- Python
- JavaScript
- API Reference
- Platform
- Hot Reload & Auto-Refresh: Indexes can now automatically detect and reload when updated in the cloud.
load_index()now accepts optionalauto_refreshandpolling_interval_in_secondsparameters- When
auto_refreshis enabled, the SDK polls for updates at the configured interval (default: 600 seconds) - To stop auto-refresh, call
load_index()again without theauto_refreshoption
load_index()now allows reloading an already-loaded index (previously threw an error)- Index management now uses Rust core for improved performance and reliability
- Adds partial support for Python 3.14 by disabling local embedding service functionality. Full support coming soon.
- Adds support for user-supplied embeddings.
query()now automatically falls back to the cloud API when the index is not loaded locally, enabling queries without requiringload_index()first.- Adds better scoring evaluation for search results
- Removes the ’< 2’ upper bound on numpy dependency.
- Dropped support for Python 3.9 and below
- Bug fix: Keyword search now functions correctly after
load_index(). - New service endpoint with significant infrastructure upgrades. Management operations are now ~3× faster across most real-world use cases, providing faster index operations while also supporting larger payloads.
- Updated
inferedge-moss-coredependency to version 0.2.3 for new ARM64 wheel support
- Added IntelliSense support in all IDEs
- Added support for keyword search
- Added alpha blending between keyword and semantic search
- Removed Pipecat integration and MossContextRetriever from the SDK
- Will be offered as a separate Pipecat extension soon
- Performance improvements for
query()calls
New Features & Updates:
- Integrations:
- Added MossContextRetriever for Pipecat integration in real-time voice AI applications
- Automatically enhances LLM conversations with semantic search results from Moss indexes
- Seamless integration with OpenAI LLM context frames
New Features & Updates:
-
Core:
- Initial release of inferedge-moss
- Multi-index support for isolated search spaces
- API key validation with secure host access
- Performance metrics tracking
-
Search:
- Semantic search using transformer-based embeddings
- Query interface with configurable result count
- Add, update, and remove items across indexes
-
Embeddings:
- Lightweight embedding models for edge computing
- Support for proprietary “moss-minilm” model
- Cloudflare CDN support for fast model loading