ATLAS
Vector Search
Build intelligent applications powered by semantic search and generative AI using native, full-featured vector database capabilities.
What is vector search?
Generative AI uses vectors to enable intelligent semantic search over unstructured data (text, images, and audio). Vectors are critical in building recommendation engines, anomaly detection, and conversational AI. The wide range of use cases, made possible with native capabilities in MongoDB, deliver transformative user experiences.
The combined power of vectors and MongoDB
Head of Content Digitalisation, Novo Nordisk
Learning hub
FAQ
What are vectors or vector embeddings?
How does Atlas Vector Search differ from Atlas Search?
Atlas Vector Search allows searching through data based on semantic meaning captured in vectors, whereas Atlas Search allows for keyword search (i.e., based on the actual text and any defined synonym mappings).
Can I use MongoDB Atlas instead of a standalone vector database?
Yes, MongoDB Atlas is a vector database. Atlas is a fully managed, multi-cloud developer data platform with a rich array of capabilities that includes text or lexical and vector search. Rather than use a standalone or bolt-on vector database, the versatility of our platform empowers users to store their operational data, metadata, and vector embeddings on Atlas and seamlessly use Atlas Vector Search for indexing, retrieval, and building performant generative AI applications.
What's the difference between K-Nearest Neighbor (KNN) Search, Approximate Nearest Neighbor (ANN) Search, and Exact Nearest Neighbor (ENN) Search? When to use what?
KNN stands for "K-Nearest Neighbors," which is the algorithm frequently used to find vectors near one another.
ANN stands for "Approximate Nearest Neighbors" and it is an approach to finding similar vectors that trades accuracy in favor of performance. This is one of the core algorithms used to power Atlas Vector Search. Our algorithm for Approximate Nearest Neighbor search uses the Hierarchical Navigable Small World (HNSW) graph for efficient indexing and querying of millions of vectors.
ENN stands for “Exact Nearest Neighbors” and it is an approach to finding similar vectors that might trade some performance in favor of accuracy. This method returns the exact closest vectors to a query vector, with the number of vectors specified by the variable limit. Exact vector search (ENN) query execution can maintain sub-second latency for unfiltered queries up to 10,000 documents. It can also provide low-latency responses for highly selective filters that restrict a broad set of documents into 10,000 documents or less, ordered by vector relevance.
What is $vectorSearch and how does it differ from the knnBeta operator in $search?
$vectorSearch is an aggregation stage in MongoDB Atlas that lets you execute an Approximate Nearest Neighbor (ANN) or Exact Nearest Neighbor (ENN) query with MongoDB Query API filtering (e.g., “$eq” or “$gte”). This stage is supported on Atlas clusters version 6.0 and higher. The Atlas Search knnVector field type and knnBeta operator in $search are now deprecated.
Which vector embeddings does Atlas Vector Search support? Is there support for vector quantization?
Atlas Vector Search supports embeddings from any provider that is under the 4096-dimension limit on the service.
We support the ingestion, indexing, and querying of scalar and binary quantized vectors from embedding providers. We also provide the option to implement automatic scalar and binary quantization of full-fidelity vectors in Atlas Vector Search.
Does Atlas Vector Search work with images, media files, and other types of data?
Yes, Atlas Vector Search can query any kind of data that can be turned into a vector embedding. One of the benefits of the document model is that you can store your embeddings right alongside your rich data in your documents.
Get started with Atlas Vector Search
- Simplified deployment
- Unified developer experience
- Horizontal, vertical, independent scale
- Integrated AI ecosystem
- 118+ regions worldwide