- Vector similarity (semantic) - matches on meaning
- Keyword / BM25 - matches on exact terms
- Hybrid - blends both, tuned with
alpha
load_index()
(or open a session) first; queries then run entirely in-memory
(~1-10 ms).
Basic query
Go deeper
Hybrid search
Blend semantic and keyword scoring with
alpha.Metadata filtering
Narrow results by document metadata.
Custom embeddings
Bring your own query and document vectors.
Multi-index search
Query several loaded indexes in one call.
Tuning
- Adjust
topK/top_kand score thresholds - Layer metadata filters to narrow candidate sets
- Group queries by intent (returns, billing, onboarding) and tune per index
- Choose model per index:
moss-minilm(fast) ormoss-mediumlm(more accurate)