Moss - Real-time Semantic Search for Conversational AI

Python
JavaScript
API Reference
Platform

2026-02-02

v1.0.0-beta.13

Hot Reload & Auto-Refresh: Indexes can now automatically detect and reload when updated in the cloud.
- load_index() now accepts optional auto_refresh and polling_interval_in_seconds parameters
- When auto_refresh is enabled, the SDK polls for updates at the configured interval (default: 600 seconds)
- To stop auto-refresh, call load_index() again without the auto_refresh option
load_index() now allows reloading an already-loaded index (previously threw an error)
Index management now uses Rust core for improved performance and reliability

2026-01-30

v1.0.0-beta.12

Adds partial support for Python 3.14 by disabling local embedding service functionality. Full support coming soon.

2026-01-29

v1.0.0-beta.11

Adds support for user-supplied embeddings.
query() now automatically falls back to the cloud API when the index is not loaded locally, enabling queries without requiring load_index() first.
Adds better scoring evaluation for search results

2026-01-26

v1.0.0-beta.10

Removes the ’< 2’ upper bound on numpy dependency.

2026-01-14

v1.0.0-beta.9

Dropped support for Python 3.9 and below
Bug fix: Keyword search now functions correctly after load_index().
New service endpoint with significant infrastructure upgrades. Management operations are now ~3× faster across most real-world use cases, providing faster index operations while also supporting larger payloads.

2025-12-15

v1.0.0-beta.8

Updated inferedge-moss-core dependency to version 0.2.3 for new ARM64 wheel support

2025-12-01

v1.0.0-beta.7

Added IntelliSense support in all IDEs

2025-11-29

v1.0.0-beta.6

Added support for keyword search
Added alpha blending between keyword and semantic search

2025-10-23

v1.0.0-beta.5

Removed Pipecat integration and MossContextRetriever from the SDK
Will be offered as a separate Pipecat extension soon

2025-10-09

v1.0.0-beta.4

Performance improvements for query() calls

2025-10-09

v1.0.0-beta.3

New Features & Updates:

Integrations:
- Added MossContextRetriever for Pipecat integration in real-time voice AI applications
- Automatically enhances LLM conversations with semantic search results from Moss indexes
- Seamless integration with OpenAI LLM context frames

2025-09-14

v1.0.0-beta.1

New Features & Updates:

Core:
- Initial release of inferedge-moss
- Multi-index support for isolated search spaces
- API key validation with secure host access
- Performance metrics tracking
Search:
- Semantic search using transformer-based embeddings
- Query interface with configurable result count
- Add, update, and remove items across indexes
Embeddings:
- Lightweight embedding models for edge computing
- Support for proprietary “moss-minilm” model
- Cloudflare CDN support for fast model loading

2026-02-02

v1.0.0-beta.6

Hot Reload & Auto-Refresh: Indexes can now automatically detect and reload when updated in the cloud.
- loadIndex() now accepts optional LoadIndexOptions with autoRefresh and pollingIntervalInSeconds parameters
- When autoRefresh is enabled, the SDK polls for updates at the configured interval (default: 600 seconds)
- To stop auto-refresh, call loadIndex() again without the autoRefresh option
loadIndex() now allows reloading an already-loaded index (previously threw an error)

2025-01-29

v1.0.0-beta.5

Query optimizations for custom-embedding workflow

2025-01-28

v1.0.0-beta.4

Fixed ReferenceError: process is not defined crash in browser environments. The SDK now works seamlessly across all JavaScript runtimes including browsers, Node.js, Deno, and Bun.

2025-01-24

v1.0.0-beta.3

Added

Support for user-supplied document embeddings during ingestion. The SDK supports optional embedding arrays in DocumentInfo payloads without using the native embedding service from moss.
Query overloads now accept QueryOptions so users can provide a custom embedding alongside query text.
Relaxed modelId requirement when creating indexes. The SDK aligns with the service default of moss-minilm when no explicit model is provided.
query() now automatically falls back to the cloud API when the index is not loaded locally, enabling queries without requiring loadIndex() first.

Enhancements

New service endpoint with significant infrastructure upgrades. Management operations are now ~3× faster across most real-world use cases, providing faster index operations while also supporting larger payloads.

2025-11-30

v1.0.0-beta.2

Bug Fixes:

Core:
- Fixed ESM (ES Module) import compatibility issue
- Package now correctly exports as an ES module with standard ESM syntax support

Breaking Changes:

Core:
- Migration required from CommonJS (require) to ES Module syntax (import)

2025-10-01

v1.0.0-beta.1

New Features & Updates:

Core:
- Initial release of @inferedge/moss
- Multi-index support for isolated search spaces
- TypeScript support with full type definitions
Search:
- Semantic search using transformer-based embeddings
- Blazing fast querying support after loading indexes
- Add, update, and remove documents across indexes
Embeddings:
- Lightweight embedding models for edge computing
- Support for proprietary “moss-minilm” and “moss-mediumlm” models

Changelog

Product Updates