Moss - Real-time Semantic Search for Conversational AI

Semantic (vector) search captures meaning; keyword (BM25) search captures exact terms. Hybrid search blends both with one parameter, alpha, so you can tune relevance per query or per index. As with all queries, load the index first (or open a session).

The `alpha` parameter

`alpha`	Behavior
`1.0`	Pure semantic (embeddings only)
`0.0`	Pure keyword (BM25 only)
between	Blends the two; default is semantic-heavy at `0.8`

Choosing alpha

Lower alpha (toward keyword) when queries contain exact identifiers, SKUs, names, or jargon.
Higher alpha (toward semantic) when queries are natural-language paraphrases.
Tune per index and per intent (returns, billing, onboarding, etc.).

Implementation

Runnable, per-language examples live in the SDK guides:

Python
JavaScript

Metadata filtering

Constrain results by document metadata.

Custom embeddings

Bring your own vectors.

Retrieval Metadata Filtering

⌘I

Getting Started

Capabilities

Use Cases

How it works

Pricing

Hybrid Search

The `alpha` parameter

Choosing alpha

Implementation

Metadata filtering

Custom embeddings

​The alpha parameter

​Choosing alpha

​Implementation

​Related

Metadata filtering

Custom embeddings

The `alpha` parameter

Choosing alpha

Implementation

Related