Building AI with MongoDB: Retrieval-Augmented Generation (RAG) Puts Power in Developers’ Hands

Mat Keep
November 28, 2023 | Updated: August 8, 2024
#genAI #Vector Search

As recently as 12 months ago, any mention of retrieval-augmented generation (RAG) would have left most of us confused. However, with the explosion of generative AI, the RAG architectural pattern has now firmly established itself in the enterprise landscape.

RAG presents developers with a potent combination. They can take the reasoning capabilities of pre-trained, general-purpose LLMs and feed them with real-time, company-specific data. As a result, developers can build AI-powered apps that generate outputs grounded in enterprise data and knowledge that is accurate, up-to-date, and relevant. They can do this without having to turn to specialized data science teams to either retrain or fine-tune models — a complex, time-consuming, and expensive process.

Over this series of Building AI with MongoDB blog posts, we’ve featured developers using tools like MongoDB Atlas Vector Search for RAG in a whole range of applications. Take a look at our AI case studies page and you’ll find examples spanning conversational AI with chatbots and voice bots, co-pilots, threat intelligence and cybersecurity, contract management, question-answering, healthcare compliance and treatment assistants, content discovery and monetization, and more.

Further reflecting its growing adoption, Retool’s State of AI survey from a couple of weeks ago shows Atlas Vector Search earning the highest net promoter score (NPS) among developers.

Check out our AI resource page to learn more about building AI-powered apps with MongoDB.

In this blog post, I’ll highlight three more interesting and novel use cases:

Unlocking geological data for better decision-making and accelerating the path to net zero at Eni
Video and audio personalization at Potion
Unlocking insights from enterprise knowledge bases at Kovai

Eni makes terabytes of subsurface unstructured data actionable with MongoDB Atlas

Based in Italy, Eni is a leading integrated energy company with more than 30,000 employees across 69 countries. In 2020, the company launched a strategy to reach net zero emissions by 2050 and develop more environmentally and financially sustainable products.

Sabato Severino, Senior AI Solution Architect for Geoscience at Eni, explains the role of his team: “We’re responsible for finding the best solutions in the market for our cloud infrastructure and adapting them to meet specific business needs.”

Projects include using AI for drilling and exploration, leveraging cloud APIs to accelerate innovation, and building a smart platform to promote knowledge sharing across the company. Eni’s document management platform for geosciences offers an ecosystem of services and applications for creating and sharing content. It leverages embedded AI models to extract information from documents and stores unstructured data in MongoDB.

The challenges for Severino’s team were to maintain the platform as it ingested a growing volume of data — hundreds of thousands of documents and terabytes of data — and to enable different user groups to extract relevant insights from comprehensive records quickly and easily.

With MongoDB Atlas, Eni users can quickly find data spanning multiple years and geographies to identify trends and analyze models that support decision-making within their fields. The platform uses MongoDB Atlas Search to filter out irrelevant documents while also integrating AI and machine learning models, such as vector search, to make it even easier to identify patterns.

“The generative AI we’ve introduced currently creates vector embeddings from documents, so when a user asks a question, it retrieves the most relevant document and uses LLMs to build the answer,” explains Severino.

“We’re looking at migrating vector embeddings into MongoDB Atlas to create a fully integrated, functional system. We’ll then be able to use Atlas Vector Search to build AI-powered experiences without leaving the Atlas platform — a much better experience for developers.”

Read the full case study to learn more about Eni and how it is making unstructured data actionable.

Video personalization at scale with Potion and MongoDB

Potion enables salespeople to personalize prospecting videos at scale. Already over 7,500 sales professionals at companies including SAP, AppsFlyer, CaptivateIQ, and Opensense are using SendPotion to increase response rates, book more meetings, and build customer trust.

All a sales representative needs to do is record a video template, select which words need to be personalized, and let Potion’s audio and vision AI models do the rest. Kanad Bahalkar, co-founder and CEO at Potion explains:

“The sales rep tells us what elements need to be personalized in the video — that is typically provided as a list of contacts with their name, company, desired call-to-action, and so on. Our vision and audio models then inspect each frame and reanimate the video and audio with personalized messages lip-synced into the stream. Reanimation is done in bulk in minutes. For example, one video template can be transformed into over 1,000 unique video messages, personalized to each contact.”

Potion’s custom generative AI models are built with PyTorch and TensorFlow, and run on Amazon Sagemaker. Describing their models, Kanad says “Our vision model is trained on thousands of different faces, so we can synthesize the video without individualized AI training. The audio models are tuned on-demand for each voice.”

And where does the data for the AI lifecycle live? “This is where we use MongoDB Atlas,” says Kanad.

“We use the MongoDB database to store metadata for all the videos, including the source content for personalization, such as the contact list and calls to action. For every new contact entry created in MongoDB, a video is generated for it using our AI models, and a link to that video is stored back in the database. MongoDB also powers all of our application analytics and intelligence. With the insights we generate from MongoDB, we can see how users interact with the service, capturing feedback loops, response rates, video watchtimes, and more. This data is used to continuously train and tune our models in Sagemaker."

On selecting MongoDB Kanad says, “I had prior experience of MongoDB and knew how easy and fast it was to get started for both modeling and querying the data. Atlas provides the best-managed database experience out there, meaning we can safely offload running the database to MongoDB. This ease-of-use, speed, and efficiency are all critical as we build and scale the business."

To further enrich the SendPotion service, Kanad is planning to use more of the developer features within MongoDB Atlas. This includes Atlas Vector Search to power AI-driven semantic search and RAG for users who are exploring recommendations across video libraries. The engineering team is also planning on using Atlas Triggers to enable event-driven processing of new video content.

Potion is a member of the MongoDB AI Innovators program. Asked about the value of the program, Kanad responds, “Access to free credits helped support rapid build and experimentation on top of MongoDB, coupled with access to technical guidance and support."

Bringing the power of Vector Search to enterprise knowledge bases

Founded in 2011, Kovai is an enterprise software company that offers multiple products in both the enterprise and B2B SaaS arena. Since its founding, the company has grown to nearly 300 employees serving over 2,500 customers.

One of Kovai’s key products is Document360, a knowledge base platform for SaaS companies looking for a self-service software documentation solution. Seeing the rise of GenAI, Kovai began developing its AI assistant, “Eddy.” The assistant provides answers to customers' questions utilizing LLMs augmented by retrieving information in a Document360 knowledge base.

During the development phase Kovai’s engineering and data science teams explored multiple vector databases to power the RAG portion of the application. They found the need to sync data between its system-of-record MongoDB database and a separate vector database introduced inaccuracies in answers from the assistant.

The release of MongoDB Atlas Vector Search provided a solution with three key advantages for the engineers:

Architectural simplicity: MongoDB Vector Search's architectural simplicity helps Kovai optimize the technical architecture needed to implement Eddy.
Operational efficiency: Atlas Vector Search allows Kovai to store both knowledge base articles and their embeddings together in MongoDB collections, eliminating “data syncing” issues that come with other vendors.
Performance: Kovai gets faster query response from MongoDB Vector Search at scale to ensure a positive user experience.

Atlas Vector Search is robust, cost-effective, and blazingly fast!
Said Saravana Kumar, CEO, Kovai, when speaking about his team's experience

Specifically, the team has seen the average time taken to return three, five, and 10 chunks between two and four milliseconds, and if the question is a closed loop, the average time reduces to less than two milliseconds.

You can learn more about Kovai’s journey into the world of RAG in the full case study.

Getting started

As the case studies in our Building AI with MongoDB series demonstrate, retrieval-augmented generation is a key design pattern developers can use as they build AI-powered applications for the business. Take a look at our Embedding Generative AI whitepaper to explore RAG in more detail.

Head over to our quick-start guide to get started with Atlas Vector Search today.

← Previous

使用 MongoDB Atlas构建客户单一视图

开展一项持久、成功的业务，关键在于了解客户。如果你真正了解你的客户，你便掌握了他们的需求和欲望，进而在对的时间以对的方式交付适宜的商品。然而，对于绝大多数 B2C 企业而言，由于存在大量分散的数据，很难构建单一的客户视图。企业收集客户数据的场景有很多，比如电商平台、CRM、ERP、忠诚度计划、支付端口、网络 APP、手机 APP 等等。各数据集可能是结构化、半结构化或非结构化，以流处理的形式交付或需要批处理，进一步加重了让碎片化的客户数据编辑工作。一些企业开始寻求定制解决方案，但只能提供部分客户视图。孤岛数据集让运营变得极具挑战性，包括客户服务、定向市场营销和高级分析（如流失预测和推荐）等。只有获得 360 度的客户视图，企业才能真正理解客户的需要、欲望和要求，进而谈及满足客户需求。因此，360 度数据的单一视图成为实现持久客户关系的关键。在本篇文章中，我们将详细分析如何通过 MongoDB 数据库和 Cogniflare Calledio Customer 360 工具架构客户单一视图，并依照现实世界中的使用案例了解情感分析。使用 Calleido Customer 360 构建单一视图借助 Customer 360 数据库，企业机构能够获得和分析多种类型的个体交互和触点，进而构建客户整体视图。实现途径是通过一系列不同的资源获取数据。不过，这些数据信息的发送和转换既复杂又耗时，现有的许多大数据手段也并不适配云环境。为了解决这些挑战和困难，Cogniflare 推出了 Calleido 。图 1：Calleido Customer 360 用例架构 Calleido 是一个数据处理平台，基于久经考验的开源工具，如 Apache NiFi。Calleido 拥有 300 个处理器，可轻松异动结构化和非结构化数据，打破地域束缚。它提供批量和实时更新，处理简单的数据转换。更重要的是，Calleido 能够与 Google Cloud 无缝整合，实现一键式部署。它利用 Google Kubernetes Engine 按需进行纵向和横向扩展，打造直观、流畅的低代码开发环境。图 2：使用 Calleido 数据管道，将客户数据从 PostgreSQL 复制到 MongoDB 现实世界用例：客户电子邮件的情感分析下面通过对客户电子邮件的情感分析用例来演示 Cogniflare Calleido 、 MongoDB Atlas 和 Customer 360 视图。为简化 Customer 360 数据库的构建，Cogniflare 团队创建了工作流模板，可在几秒钟完成数据管道部署。在接下来的章节中，我们将详细介绍一些常用的数据转移模式，来演示本 Customer 360 用例和控制面板样本。图三：控制面板样本自处理器从电子邮件服务器 (ConsumeIMAP) 提取 IMAP 消息起，工作流便开始了。进入所选收件箱（即客户服务）的每封新邮件都会触发一次事件。接着，会提取电子邮件标题来判断有关电子邮件内容的重要详情 (ExtractEmailHeaders)。 Calleido 会借助发送人的电子邮件识别客户 (UpdateAttribute)，并通过执行脚本来提取电子邮件全文 (ExecuteScript)。此时，基于已收集到的所有数据，可形成消息有效负载，并通过 Google Cloud Platform (GCP) Pub/Sub（还可使用 Kafka）发布，满足下游工作流和其他服务的使用。图 4：将电子邮件翻译为 Cloud PubSub 消息接着，会用到前一工作流中的 GCP Pub/Sub 消息 (ConsumeGCPPubSub)。此时，我们会借助 MongoDB Atlas 的整合功能来验证MongoDB 数据库中的每一位发件人 (GetMongo)。如果某位客户已存在于我们的系统中，那么我们会把该电子邮件数据发送至下一工作流，然后忽略其他电子邮件。图 5：使用 MongoDB 和 Calleido 确认客户电子邮件随后，开始分析电子邮件正文。在本工作流中，我们使用处理器准备一份请求正文，发送至 Google云自然语言 AI获取消息的语气和情感信息。语言处理 API 的结果会直接发送至 MongoDB Atlas，进入控制面板。图 6：使用 Calleido 拨打云 AutoML 电话控制面板中的最终结果 Customer 360 数据库可用于内部后台业务系统，补充和通知客户支持。在单一视图的加持下，故障排除、退货和投诉处理都变得更加便捷、高效。利用之前的客户通话信息，可确保为每一位客户提供最恰当、有效的回应。这些数据集还可导入分析系统，促进学习和优化，例如将负面情感和流失率相关联。 MongoDB 文档数据库的作用在上述示例中，Calleido 负责将企业源系统中的数据复制和发送至 MongoDB Atlas——运营数据存储 (ODS)。得益于 MongoDB 灵活的数据架构，我们能够以原始格式传输数据，后续还能够以迭代的方式执行必要的模式转换，无需运行复杂的模式迁移，快速交付单一视图数据库。图 7 和 8：使用 Calleido 数据管道，将产品和订单从 PostgreSQL 复制到 MongoDB Atlas Calleido 可以让我们通过简单的几步便完成此转变。此工具运行自定义 SQL 查询 (ExecuteSQL)，汇总来自外部图表的全部所需数据，编译结果，以便进行并行处理。收到的数据为 Avro 格式，Calleido 随后将其转换为 JSON (ConvertAvroToJSON)，并转移至 MongoDB (JoltTransformJSON) 的模式中。 Customer 360 控制面板中的最终结果 MongoDB Atlas 是面向 Customer 360 数据库的行业领先之选。以下是其称为世界级标杆的主要原因： MongoDB 可有效处理来自原有系统的非标准化模式，并存储为任意自定义属性数据模型包括作为嵌套文档的所有相关数据。有别于 SQL 数据库，MongoDB 可规避难以写入和操作的复的加入查询。 MongoDB 非常快速，当前的客户视图能够在几毫秒内呈现，无需引入缓存层。 MongoDB 灵活的模式模型可通过迭代的方式实现敏捷性。在最初的提取中，数据几乎可以按照原始形状进行复制，进而大幅降低延迟。在后续阶段中，无需繁琐的 SQL 迁移，即可标准化模式，提升数据质量。 MongoDB 可跨越多个数据中心存储几十 TB 数据，轻松实现横向扩展可跨越多个区域分享数据，有效应对合规性要求。可设置独立的分析节点，避免影响生产系统的性能。 MongoDB 在作为单一视图数据库运行上具有有迹可循的业绩记录，曾有多家大型传统组织在两周内即运行原型，一个业务季度内即投入生产。 MongoDB Atlas 可直接自动扩展，降低成本，应对流量高峰。数据可实现动态和静态加密，有助于满足安全和隐私标准，包括 GDPR、HIPAA、PCI-DSS 和 FERPA。向客户追加销售：产品推荐向客户追加销售是现代业务的关键环节之一，其成功的诀窍在于：减少直接推销，更多专注培养和引导。即使用数据识别客户所处的购买阶段，他们的所思所想以及通过何种产品或服务能够满足需求。基于客户的购买记录，Calleido 可将数据发送至相应的工具（如 BigQuery ML），进而协助完成产品推荐。接着，这些内容可通过客服中心和市场营销团队进行线上或手机 APP 推送。实现这一目标，涉及两个工作流：准备训练数据和生成产品推荐：准备训练数据首先，使用 ExecuteSQL 处理器将合适的数据从 PostgreSQL 转移至 BigQuery。数据管道可以编排为定期执行。下一步，从 PostgreSQL 获取合适的数据，借助 ExecuteSQLRecord 处理器分割为 1,000 行的数据块。接着，这些文件会传送至下一个处理器，通过负载平衡利用所有可用的节点。然后，所有上述数据会通过 PutBigQueryStreaming 处理器插入至 BigQuery 表中。图 9：通过 Calleido 从 PostgreSQL 复制数据至 BigQuery 生成产品推荐接下来，我们介绍产品推荐的生成。首先，必须购买 Big Query 容量槽，以最经济的方式使用 BigQuery ML 的各项功能。此时，Calleido 会通过 ExecuteSQL 处理器调用 SQL 程序，确保所需的 BigQuery 容量可正常使用。下一个处理器 (ExecuteSQL) 将执行 SQL 查询，使用从第一个工作流中复制的数据创建和训练 Matrix Factorization 机器学习模型。随后，Calleido 使用 ExecuteSQL 处理器查询已受训的模型获取所有预测，并存储在专属的 BigQuery 表格中。最后，Wait 处理器等待所有容量槽的移除，因为已不再需要。图 10 和 11：通过 Calleido 生成产品推荐接着，我们借助两个处理器移除旧的推荐。首先，ReplaceText 处理器会更新即将开始的工作流文件内容，设置查询主体，方便DeleteMongo 处理器用于执行移除操作。图 12：移除旧的推荐将推荐复制到 MongoDB 便完成了整个工作流。ExecuteSQL 处理器获取和集合每位用户的前 10 项推荐，均以 1,000 行的数据块呈现。接着，以下两个处理器（ConvertAvroToJSON 和 ExecuteScript）备好数据，通过 PutMongoRecord 处理器插入 MongoDB 集合。图 13：将推荐复制到 MongoDB Customer 360 控制面板中的最终结果（本示例中所用的数据为自动生成）： MongoDB Atlas 上 Calleido 360 客户数据库的优势如果数据位于集中操作数据存储（如 MongoDB）中，那么可通过 Calleido 与分析数据存储（如 Google BigQuery）进行同步。借助Customer 360 数据库，内部相关方可将数据用于：通过细分和定向市场营销来提升客户满意度精准、快速访问合规性审计构建需求规划展望和市场趋势分析奖励客户忠诚，降低流失率最终，客户单一视图不仅能够帮助企业机构向潜在的买家精准交付消息，还能将处于品牌认知阶段的客户引流到转化阶段，并确保客户保留和售后机制高效运转。在过去，客户 360 视图是个繁杂、碎片化的过程；但现在依托 Cogniflare 的 Calleido 和 MongoDB Atlas，Customer 360 数据库已成为企业机构放心使用的功能强大、成本可控的数据管理堆栈。

November 28, 2023

Next →

Building Gen AI with MongoDB & AI Partners | December 2024

Now that 2024 is behind us, we can see clearly how much change, innovation, and progress there was across the AI landscape in 2024. For MongoDB, the year was particularly marked by collaboration with our AI partners, and by the possibilities that AI collaboration holds; as the saying goes, it takes a village. From the release of breakthrough tools and frameworks, to AI-enriched workflows (for both prototyping and production), together we empowered customers and developers alike to build cutting-edge AI applications. To help you prepare for the rest of 2025, below is a selection of content developed by MongoDB’s Developer Relations team. This work will equip you with the knowledge (and tools!) from MongoDB and our AI partners to create the hottest AI applications in the new year. Building an Agent with Fireworks.AI, MongoDB, and LangChain Learn how to create an intelligent agent that combines Fireworks AI’s advanced capabilities, LangChain’s framework, and MongoDB's robust database. This guide walks you through developing an agent capable of reasoning and decision-making with structured and unstructured data. Claude 3.5 and MongoDB: Revolutionizing Retrieval-Augmented Generation Learn how Anthropic's Claude 3.5 integrates with MongoDB to enhance retrieval-augmented generation (RAG) pipelines. This post demonstrates using Claude for contextual and nuanced text generation while leveraging MongoDB Atlas for efficient data retrieval. Build an AI Agent with LangGraph.js and MongoDB Atlas Explore how LangGraph.js simplifies AI agent development for JavaScript and TypeScript developers. This tutorial showcases building an AI-powered agent and managing data with MongoDB Atlas for seamless functionality and scalability. Ingesting Quantized Vectors with Cohere and MongoDB Discover how to leverage Cohere’s quantized vector representations and MongoDB Atlas for efficient vector storage and retrieval. This guide demonstrates workflows for building scalable, high-performance applications that use vector embeddings for AI-driven solutions. And if you’d like to dig into building with MongoDB and gen AI, explore our GenAI Showcase repository on GitHub for a wide range of sample projects, tools, and inspiration to kickstart your AI journey into 2025! Happy New Year—and happy building! Welcoming new AI and tech partners In December 2024, we welcomed six new AI and tech partners that offer product integrations with MongoDB. Read on to learn more about each great new partner! Apigene Apigene enables users to operate all software applications through a single AI assistant, providing complete control of popular services and platforms. " We're excited to partner with MongoDB to bring natural language capabilities to Atlas users, transforming how teams interact with their data”, said Michal Geva, VP of Business of Apigene. “This collaboration makes database operations as intuitive as having conversations, empowering businesses to unlock Atlas’s full potential without complexity." Bauplan Bauplan is a programmable data lake where users can load, transform, query, run, schedule, and replay all from their code, driving superior cost-efficiency and less management from data teams. “ We're pretty darn excited about partnering up with MongoDB because the combination of Bauplan and MongoDB Atlas makes it so incredibly easy to build full-stack AI applications”, said Ciro Greco, CEO and founder of Bauplan. “One can build powerful applications like embedded analytics, feature stores, recommender systems, and RAG based search in a simple Python script. Zero infrastructure overhead, compute is purely serverless and everything's version controlled in the data lake by default.” Botnoi BOTNOI Group offers innovative AI technologies that enhance business operations such as a conversational AI chatbot for enterprise, speech-to-text, text-to-speech, and computer vision. " We’re excited to announce our partnership with MongoDB ”, said Piyoros Tungthamthiti, CTO of BOTNOI Group. “By integrating MongoDB Atlas, we’re enhancing Botnoi’s capabilities to deliver top-tier conversational AI performance. This collaboration will enable seamless data management, advanced analytics, and reliable system performance, ultimately providing greater value to our clients." Jiva.ai Jiva.ai is a zero-code platform for rapid multimodal AI development using structured and unstructured data. " We are thrilled to join MongoDB's ecosystem and bring our no-code AI platform together with their powerful vector search and multimodal data capabilities,” said Dr. Manish Patel, CEO of Jiva.ai. “MongoDB enables us to help businesses rapidly transform complex data into intelligent solutions, democratizing AI development across industries. By combining Jiva.ai's patented model fusion technology with MongoDB's flexible document model, we're accelerating enterprise AI adoption and helping organizations unlock unprecedented insights from their data." mple.ai mple.ai is an AI-powered sales training platform for enterprises, designed to deliver scalable, measurable, and impactful training through role-plays and AI-driven evaluations. " Our collaboration with MongoDB is redefining AI-driven team training”, said Riddhesh Ganatra, Co-Founder of mple.ai. “With MongoDB's reliable and scalable data solutions, we're delivering real-world scenario-based coaching to help organizations achieve faster, more impactful results." TrueFoundry TrueFoundry is a Kubernetes-based platform designed to simplify the process of building, deploying and scaling compound AI systems across any cloud or on-premise infrastructure. “ We’re thrilled to partner with MongoDB to accelerate the development of compound AI applications”, said Nikunj Bajaj, CEO of TrueFoundry. “With TrueFoundry’s powerful accelerators, including AI Gateway, Model Deployment & Finetuning, and RAG Framework, combined with MongoDB’s scalable vector database, enterprises can quickly build, deploy, and scale production-grade AI solutions. TrueFoundry’s platform ensures robust governance, cost optimization, and faster time to value, empowering enterprises to innovate efficiently and at scale.” But wait, there's more! To learn more about building AI-powered apps with MongoDB, check out our AI Resources Hub and stop by our Partner Ecosystem Catalog to read about our integrations with MongoDB’s ever-evolving AI partner ecosystem.

January 16, 2025