WEBINARHow to build smarter AI apps with Python and MongoDB. Register now >
NEWLearn MongoDB with expert tutorials and tips on our new Developer YouTube channel. Subscribe >

On-Demand Webinar

Scaling Vector Database Operations with MongoDB and Voyage AI

The performance and scalability of your AI application depend on efficient vector storage and retrieval. In this webinar, we explore how MongoDB Vector Search on Atlas and Voyage AI embeddings optimize these aspects through quantization—a technique that reduces the precision of vector embeddings (e.g., float32 to int8) to decrease storage costs and improve query performance while managing accuracy trade-offs.

Vector embeddings are the foundation of AI-driven applications, along with powerful capabilities such as retrieval-augmented generation (RAG), semantic search, and agent-based workflows. However, as data volumes grow, the cost and complexity of storing and querying high-dimensional vectors increase.

Senior Staff Developer Advocate Anant Srivastava covered practical strategies for converting embeddings to lower-bit representations, balancing performance with accuracy. In a step-by-step tutorial, he shows you how to apply these optimizations using Voyage AI embeddings to reduce both query latency and infrastructure costs.

Key Takeaways:

  • How quantization works to dramatically reduce the memory footprint of embeddings

  • How MongoDB Vector Search on Atlas integrates automatic quantization to efficiently manage millions of vector embeddings

  • Real-world metrics for retrieval latency, resource utilization, and accuracy across float32, int8, and binary embeddings

  • Combining binary quantization with a rescoring step yields near float32-level accuracy with a fraction of the computational overhead

  • Best practices and tips for balancing speed, cost, and precision—especially at the 1M+ embedding scale essential for RAG, semantic search, and recommendation systems


More like this

View all resources
general_content_tutorial

Introduction to MongoDB

Watch to learn the fundamentals of the world’s most popular NoSQL database, MongoDB.

Learn More
mdb_vector_search

Intro to Vector Search

Explore how AI and MongoDB Vector Search on Atlas are enabling a new generation of smart, context-aware applications.

Learn More
atlas_performance_advisor

AI-Driven Outcomes: How MongoDB Is Helping Organizations Win

See how real companies are using generative AI technologies to accelerate time to value, optimize costs, and improve customer satisfaction.

Learn More