Leveraging Database Observability at MongoDB: Real-Life Use Case

Frank Sun and Sabina Friden
July 31, 2024 | Updated: March 6, 2025

This post is the second in our three-part series, Leveraging Database Observability at MongoDB.

Welcome back to the Leveraging Database Observability at MongoDB series. In our last discussion, we explored MongoDB's unique observability strategy using out-of-the-box tools designed to automatically monitor and optimize customer databases. These tools provide continuous feedback to answer critical questions such as what is happening, where is the issue, why is it occurring, and how do I fix it? This ensures enhanced performance, increased productivity, and minimized downtime.

So let’s dive into a real-life use case, illustrating how different tools in MongoDB Atlas come together to address database performance issues. Whether you're a DBA, developer, or just a MongoDB enthusiast, our goal is to empower you to harness the full potential of your data using the MongoDB observability suite.

Why is it essential to diagnose a performance issue?

Identifying database bottlenecks and pinpointing the exact problem can be daunting and time-consuming for developers and DBAs.

When your application is slow, several questions may arise:

Have I hit my bandwidth limit?
Is my cluster under-provisioned and resource-constrained?
Does my data model need to be optimized, or cause inefficient data access?
Do my queries need to be more efficient, or are they missing necessary indexes?

MongoDB Atlas provides tools to zoom in, uncover insights, and detect anomalies that might otherwise go unnoticed in vast data expanses.

Let’s put it into practice

Let's consider a hypothetical scenario to illustrate how to track down and address a performance bottleneck.

Setting the context

Imagine you run an online e-commerce store selling a popular item. On average, you sell about 500 units monthly. Your application comprises several services, including user management, product search, inventory management, shopping cart, order management, and payment processing.

Recently, your store went viral online, driving significant traffic to your platform. This surge increased request latencies, and customers began reporting slow website performance.

Identifying the bottleneck

With multiple microservices, finding the service responsible for increased latencies can be challenging. Initial checks might show that inventory loads quickly, search results are prompt, and shopping cart updates are instantaneous. However, the issue might be more nuanced and time-sensitive, potentially leading to a full outage if left unaddressed.

The five-step diagnostic process

To resolve the issue, we’ll use a five-step diagnostic process:

Gather data and insights by collecting relevant metrics and data.
Generate hypotheses to formulate possible explanations for the problem.
Prioritize hypotheses to use data to identify the most likely cause.
Validate hypotheses by confirming or disproving the top hypothesis.
Implement and observe to make changes and observe the results.

Applying the five-step diagnostic process for resolution

Let’s see how this diagnostic process unfolds:

Step 1: Gather Data and Insights

Customers report that the website is slow, so we start by checking for possible culprits. Inefficient queries, resource constraints, or network issues are the primary suspects.

Step 2: Generate Hypotheses

Given the context, the application could be making inefficient queries, the database could be resource-constrained, or network congestion could be causing delays.

Step 3: Prioritize Hypotheses

We begin by examining the Metric Charts in Atlas. Since our initial check revealed no obvious issues, we will investigate further.

Step 4: Validate Hypotheses

Using Atlas' Namespace Insights, we break down the host-level measurements to get collection-level data. We notice that the transactions.transactions collection has much higher latency than others. By increasing our lookback period to a week, the latency increased just over 24 hours ago when customers began reporting slow performance. Since this collection stores details about transactions, we use the Atlas Query Profiler to find that the queries are inefficient because they’re scanning through the whole transaction documents. This validates our hypothesis that application slowness was due to query inefficiency.

Screenshot of the MongoDB Atlas New Query Insights Tab. The tab displays graphs that indicate latency, and have options for optimizing performance. — Figure 1: New Query Insights Tab

Step 5: Implement and Observe

We need to create an index to resolve the collection scan issue. The Atlas Performance Advisor suggests an index on the customerID field. Adding this index enables the database to locate and retrieve transaction records for the specified customer more efficiently, reducing execution time. After creating the index, we return to our Namespace Insights page to observe the effect. We see that the latency on our transactions collection has decreased and stabilized. We can now follow up with our customers to update them on our fix and assure them that the problem has been resolved.

Conclusion

By gathering the correct data, working iteratively, and using the MongoDB observability suite, you can quickly resolve database bottlenecks and restore your application's performance.

In our next post in the "Leveraging Database Observability at MongoDB" series, we’ll show how to integrate MongoDB metrics seamlessly into central observability stacks and workflows. This 'plug-and-play' experience aligns with popular monitoring systems like Datadog, New Relic, and Prometheus, offering a unified view of application performance and deep database insights in a comprehensive dashboard.

Sign up for MongoDB Atlas, our cloud database service, to see database observability in action. For more information, see our Monitor Your Database Deployment docs page.

← Previous

Enhancing Retail with Retrieval-Augmented Generation (RAG)

In the rapidly evolving retail landscape, tech innovations are reshaping how businesses operate and interact with customers. Generative AI could add up to $275 billion of profit to the apparel, fashion, and luxury sectors’ by 2028, according to McKinsey analysis . One of the most promising developments in this realm is retrieval-augmented generation (RAG) , a powerful application of artificial intelligence (AI) that combines the strength of data retrieval with generative capabilities to supercharge retail enterprises. RAG offers compelling advantages specifically tailored for retailers looking to enhance their operations and customer engagement from personalization to enhanced efficiency. Let’s delve into how RAG is revolutionizing the retail sector. Check out our AI Learning Hub to learn more about building AI-powered apps with MongoDB. Why RAG in retail Imagine a customer walks into your store, and based on their previous opt-in online interactions, your technology recognizes their preferences and seamlessly guides them through a personalized service—a feat made possible by RAG. Central to RAG’s effectiveness is its ability to integrate and analyze diverse data sources scattered across data warehouses. This integration enables retailers to gain comprehensive insights into their business performance, understand consumer behavior patterns, and make data-driven decisions swiftly. Below are some of the compelling advantages that RAG can offer: Personalization: RAG enables retailers to deliver highly personalized customer experiences by leveraging AI to understand and predict individual preferences based on past interactions. Operational efficiency: By integrating diverse data sources and optimizing processes like supply chain management, RAG helps retailers streamline operations, reduce costs, and improve overall efficiency. For example, RAG aids in tracking shipments and optimizing logistics—a traditional pain point in the industry. Data utilization: It allows retailers to harness the power of big data by integrating and analyzing disparate data sources, providing actionable insights for informed decision-making. Customer engagement: RAG facilitates proactive customer engagement strategies through features like autonomous recommendation engines and hyper-personalized marketing campaigns, thereby increasing customer satisfaction and loyalty. In essence, RAG empowers retailers to harness AI's full potential to deliver superior customer experiences, optimize operations, and maintain a competitive edge in the dynamic retail landscape. But without a clear roadmap, even the most sophisticated AI solutions can falter. By pinpointing specific challenges—such as optimizing inventory management or enhancing customer service—retailers can leverage RAG to tailor solutions that deliver measurable business outcomes. Despite its transformative potential, retailers must first be AI-ready and able to integrate it in a way that enhances operational efficiency without overwhelming existing systems. To achieve this, retailers need to address data silos, ensure data privacy, and establish robust ethical guidelines for AI use. According to a Workday Global Survey , only 4% of respondents said their data is fully accessible, and 59% say their enterprise data is somewhat or completely siloed. Without a solid data foundation, retailers will struggle to achieve the benefits they are looking for from AI. Embracing the future of retail with RAG and MongoDB By harnessing the power of data integration, precise use case definition, and cutting-edge AI technologies like RAG, retail enterprises can not only streamline operations but also elevate customer experiences to unprecedented levels of personalization and efficiency. Building a gen AI operational data layer (ODL) enables retailers to make the most of their AI-enabled applications. A data layer is an architectural pattern that centrally integrates and organizes siloed enterprise data, making it available to consuming applications. As shown below in Figure 1, pulling data into a single database eliminates data silos, centralizes data management, and improves data integrity. Using MongoDB Atlas to unify structured and unstructured operational data offers a cohesive solution by centralizing all data management in a scalable, cloud-based platform. This unification simplifies data management, enhances data consistency, and improves the efficiency of AI and machine learning workflows by providing a single source of truth. With a flexible data schema, retailers can accommodate any data structure, format, or source—which is critical for the 80% of real-world data that is unstructured . Figure 1: Generative AI data layer As AI continues to evolve, the retail industry is poised to see rapid advancements, driven by the innovative use of technologies like RAG. The future of retail lies in seamlessly integrating data and AI to create smarter, more responsive business models. If you would like to learn more about RAG for Retail, visit the following resources: Presentation: Retrieval-Augmented Generation (RAG) to Supercharge Retail Enterprises White Paper: Enhancing Retail Operations with AI and Vector Search: The Business Case for Adoption The MongoDB Solutions Library is curated with tailored solutions to help developers kick-start their projects Add vector search to your arsenal for more accurate and cost-efficient RAG applications by enrolling in the MongoDB and DeepLearning.AI course " Prompt Compression and Query Optimization " for free today.

July 30, 2024

Next →

Building Gen AI with MongoDB & AI Partners | February 2025

February was big for MongoDB—and, more importantly, for anyone looking to build AI applications that deliver highly accurate, relevant information (in other words, for everyone building AI apps). MongoDB announced the acquisition of Voyage AI , a pioneer in state-of-the-art embedding and reranking models that power next-generation AI applications. Because generative AI is by nature probabilistic, models can “hallucinate”, and generate false or misleading information. This can lead to serious risks, especially in cases or industries (e.g., financial services) where accurate information is paramount. To address this, organizations building AI apps need high-quality retrieval; they need to trust that the most relevant information is extracted from their data with precision. Voyage AI’s advanced embedding and reranking models enable applications to extract meaning from highly specialized and domain-specific text and unstructured data. With roots at Stanford and MIT, Voyage AI’s world-class team is trusted by AI innovators like Anthropic, LangChain, Harvey, and Replit. Integrating Voyage AI’s technology with MongoDB will enable organizations to easily build trustworthy, AI-powered applications by offering highly accurate and relevant information retrieval deeply integrated with operational data. For more, check out MongoDB CEO Dev Ittycheria’s blog post about Voyage AI , and what this means for developers and businesses (in short, delivering high-quality results at scale). Onward! P.S. If you’re in Vegas for HumanX this week, stop by booth 412 to say hi to MongoDB! Welcoming new AI and tech partners The Voyage AI news was hardly the only exciting development last month. In February 2025, MongoDB welcomed three new AI and tech partners that offer product integrations with MongoDB. Read on to learn more about each great new partner! CopilotKit Seattle-based CopilotKit provides open source infrastructure for in-app AI copilots. CopilotKit helps organizations build production-ready copilots and agents effortlessly. “We’re excited to be partnering with MongoDB to help companies build best-in-class copilots that leverage RAG & take action based on internal data,” said Uli Barkai, Co-Founder and Chief Marketing Officer at CopilotKit. “MongoDB made it dead simple to build a scalable vector database with operational data. This collaboration enables developers to easily ship production-grade RAG applications.” Varonis Varonis is the leader in data security, protecting data wherever it lives—across SaaS, IaaS, and hybrid cloud environments. Varonis’ cloud-native Data Security Platform continuously discovers and classifies critical data, removes exposures, and detects advanced threats with AI-powered automation. “Varonis’s mission is to protect data wherever it lives,” said David Bass, Executive Vice President of Engineering and Chief Technology Officer at Varonis. “We are thrilled to further advance our mission by offering AI-powered data security and compliance for MongoDB, the database of choice for high-performance application and AI development. With this integration, joint customers can automatically discover and classify sensitive data, detect abnormal activities, secure AI data pipelines, and prevent data leaks.” Xlrt Xlrt is an automated insight-generation platform that enables financial institutions to create innovative financial credit products at scale by simplifying the financial spreading process. “We are excited to partner with MongoDB Atlas to transform AI-driven financial workflows,” said Rupesh Chaudhuri, Chief Operating Officer and Co-Founder of Xlrt. “XLRT.ai leverages agentic AI, combining graph-based contextualization, vector search, and LLMs to redefine data-driven decision-making. With MongoDB's robust NoSQL and vector search capabilities, we’re delivering unparalleled efficiency, accuracy, and scalability in automating financial processes.” To learn more about building AI-powered apps with MongoDB, check out our AI Learning Hub and stop by our Partner Ecosystem Catalog to read about our integrations with MongoDB’s ever-evolving AI partner ecosystem. And visit the MongoDB AI Applications Program (MAAP) page to learn how MongoDB and the MAAP ecosystem helps organizations build applications with advanced AI capabilities.

March 12, 2025