agno-moss. Moss manages embeddings internally and serves queries from an in-memory runtime, so Agno agents get fast response without running a separate embedder or vector database.
Why use Moss with Agno?
Agno’sKnowledge interface is the standard way to plug external knowledge into agents. Moss delivers sub-10ms semantic search that slots directly into this interface via MossRuntime, giving your agents fast, accurate retrieval without the latency overhead of a standalone vector database.
Required tools
- Moss project credentials from the Moss Portal
- Python 3.10+
- An Agno-compatible model provider, such as OpenAI or Anthropic
Integration guide
Configure credentials
Set your Moss credentials in the environment.
MossRuntime reads these automatically when project_id and project_key are omitted.Configuration
MossRuntime
| Parameter | Default | Description |
|---|---|---|
index_name | Required | Name of the Moss index |
project_id | MOSS_PROJECT_ID env var | Moss project ID |
project_key | MOSS_PROJECT_KEY env var | Moss project key |
embedding_model | "moss-minilm" | "moss-minilm" for speed or "moss-mediumlm" for higher accuracy |
alpha | 0.8 | Hybrid search blend. 1.0 = semantic only, 0.0 = keyword only |
auto_refresh | False | Auto-refresh the in-memory index when new docs are added |
polling_interval_in_seconds | 600 | Refresh interval when auto_refresh=True |