Skip to main content
2026-02-02
v1.0.0-beta.13
  • Hot Reload & Auto-Refresh: Indexes can now automatically detect and reload when updated in the cloud.
    • load_index() now accepts optional auto_refresh and polling_interval_in_seconds parameters
    • When auto_refresh is enabled, the SDK polls for updates at the configured interval (default: 600 seconds)
    • To stop auto-refresh, call load_index() again without the auto_refresh option
  • load_index() now allows reloading an already-loaded index (previously threw an error)
  • Index management now uses Rust core for improved performance and reliability
2026-01-30
v1.0.0-beta.12
  • Adds partial support for Python 3.14 by disabling local embedding service functionality. Full support coming soon.
2026-01-29
v1.0.0-beta.11
  • Adds support for user-supplied embeddings.
  • query() now automatically falls back to the cloud API when the index is not loaded locally, enabling queries without requiring load_index() first.
  • Adds better scoring evaluation for search results
2026-01-26
v1.0.0-beta.10
  • Removes the ’< 2’ upper bound on numpy dependency.
2026-01-14
v1.0.0-beta.9
  • Dropped support for Python 3.9 and below
  • Bug fix: Keyword search now functions correctly after load_index().
  • New service endpoint with significant infrastructure upgrades. Management operations are now ~3× faster across most real-world use cases, providing faster index operations while also supporting larger payloads.
2025-12-15
v1.0.0-beta.8
  • Updated inferedge-moss-core dependency to version 0.2.3 for new ARM64 wheel support
2025-12-01
v1.0.0-beta.7
  • Added IntelliSense support in all IDEs
2025-11-29
v1.0.0-beta.6
  • Added support for keyword search
  • Added alpha blending between keyword and semantic search
2025-10-23
v1.0.0-beta.5
  • Removed Pipecat integration and MossContextRetriever from the SDK
  • Will be offered as a separate Pipecat extension soon
2025-10-09
v1.0.0-beta.4
  • Performance improvements for query() calls
2025-10-09
v1.0.0-beta.3
New Features & Updates:
  • Integrations:
    • Added MossContextRetriever for Pipecat integration in real-time voice AI applications
    • Automatically enhances LLM conversations with semantic search results from Moss indexes
    • Seamless integration with OpenAI LLM context frames
2025-09-14
v1.0.0-beta.1
New Features & Updates:
  • Core:
    • Initial release of inferedge-moss
    • Multi-index support for isolated search spaces
    • API key validation with secure host access
    • Performance metrics tracking
  • Search:
    • Semantic search using transformer-based embeddings
    • Query interface with configurable result count
    • Add, update, and remove items across indexes
  • Embeddings:
    • Lightweight embedding models for edge computing
    • Support for proprietary “moss-minilm” model
    • Cloudflare CDN support for fast model loading