13.2 Vector Memory Stores
Vector memory stores are a critical component for building AI agents with long-term memory. They address the limitation of finite context windows by providing a scalable way to persist and retrieve information. The core idea is to convert text into numerical vectors (embeddings) and store them in a specialized database optimized for similarity search.
Key Concepts
- Embeddings: Dense vector representations of data. Semantically similar items are mapped to nearby points in the vector space.
- Vector Database: A database designed to store and query high-dimensional vectors, finding the "most similar" items rather than exact matches.
- Similarity Search: The process of finding vectors closest to a query vector, often using Approximate Nearest Neighbor (ANN) algorithms for speed.
The Retrieval Process
- Indexing: New information is converted into a vector by an embedding model and stored in the vector database.
- Querying: A user's query is also converted into a vector.
- Retrieval: The database searches for the most similar vectors in its memory.
- Augmentation: The retrieved information is added to the LLM's context to generate an informed response (Retrieval-Augmented Generation).
This visualization demonstrates the process of indexing new data and retrieving it via semantic search. Click the buttons to see it in action.