Embedding index (in-memory)
This backend builds an embedding index in memory and queries it using exact cosine similarity.
It is intended for textbook demos and small corpora where you want a “real” embedding retrieval loop without running an external vector database.
Backend ID
embedding-index-inmemory
What it builds
This backend builds a retrieval snapshot that materializes:
chunk records (text + boundaries + provenance)
embedding vectors for each chunk
All of this lives in memory while the process is running. For safety, the backend enforces explicit caps so a build does not accidentally consume unbounded memory.
Chunking
Embeddings are computed over chunks. Chunking is configured per configuration by selecting a chunker and its configuration.
Chunking is part of the index contract: evidence references chunk boundaries so you can trace retrieval outputs back to the original item text.
Dependencies
Requires an embedding provider configuration.
This backend does not require a database or server.