How it works
Load the indexes (in bulk withload_indexes), then query them together with
query_multi_index. Every result document carries an index_name so you know where it
came from.
Behavior
- All indexes must be loaded locally (via
load_index/load_indexes) and share the same embedding model. top_kis global, not per-index - it caps the merged result set.- Embedding-only: scoring uses vectors, so
alphais ignored (BM25 is unsound across separate corpora, where term statistics differ).filterandembeddingwork the same as in a single-index query. - Bulk lifecycle:
load_indexes(names)returns aLoadIndexesResultwithloadedandfailed(best-effort; a typo in one name doesn’t roll back the others), andunload_indexes(names)releases them and is idempotent.
Implementation
Multi-index search is a Python SDK capability. See the Python guide for a runnable example.Related
Retrieval
Single-index querying, filters, and hybrid search.
Python reference
query_multi_index, load_indexes, unload_indexes.