Introducing Gap Filling for Time Series Data in MongoDB 5.3

Jane Fine
April 6, 2022 | Updated: November 21, 2022

At MongoDB we are all about letting developers innovate with data. Time series is the fastest-growing data-intensive workload, and our native time series capabilities let you build applications faster and get more insight from time series data with less cognitive load.

It’s common for time series data to have gaps, such as when an IoT sensor goes offline. But in order to perform analytics and ensure correct results, time series data needs to be continuous. You may also want to create histograms or correlate data sets to enable more complex operational analytics in the context of app development. Gap filling, now available in MongoDB 5.3 Rapid Release, in combination with the densification we introduced in MongoDB 5.1, helps you better handle missing data to easily create and surface valuable insight.

The two new aggregation stages create a simple, streamlined way to deal with missing data across time series and regular collections powering analytics for any use case. The $densify stage creates new documents to eliminate the gaps in the time or numeric domain at the required granularity level, and $fill sets values for the fields when a value is null or missing. Filling missing values can be done with a constant or using linear interpolation, carrying over the last observation or carrying backward the next observation.

Input documents: tracking of temperature, motion, and inventory in a storage room

This function produces an hourly view of the metrics for each storage room. When temperature data is missing it should be interpolated linearly, motion should default to 0, and quantity of inventory should be carried over from the last known point.

Output: The second document was generated based on the two surrounding documents.

Previously these types of complex analytics were possible only in specialized systems such as dedicated time series databases or data warehouses. Architecturally, technology practitioners had to make the no-win trade-off between a niche, often immature technology dedicated solely to time series workloads and disconnected from systems of record containing the full complement of enterprise data and exporting time series data into data warehouses, thereby making it hard to operationalize insights. Both involve managing multiple data silos and fragile ETL pipelines driving up complexity and cost. Workarounds to these approaches often involve developers building complex data pipelines to fill in the gaps, potentially at the application layer, leading to poor query performance or limiting analytics to small data sets. With MongoDB 5.3, developers can build rich analytics on time series data in flight and deliver operational insight to their users as part of the application experience.

MongoDB 5.3 is available now. If you are running Atlas Serverless instances or have opted in to receive Rapid Releases in your dedicated Atlas cluster, then your deployment will be automatically updated to 5.3 starting today. MongoDB 5.3 is also available as a Development Release for evaluation purposes only from the MongoDB Download Center. Consistent with the new release cadence announced last year, the functionality available in 5.3 and the subsequent Rapid Releases will roll up into MongoDB 6.0, our next Major Release scheduled for delivery later this year.

For a more in-depth explanation on gap filling for time series, check out our article on the MongoDB Developer Hub.

Safe Harbor Statement

The development, release, and timing of any features or functionality described for our products remains at our sole discretion. This information is merely intended to outline our general product direction, and it should not be relied on in making a purchasing decision. Nor is this a commitment, promise, or legal obligation to deliver any material, code, or functionality.

← Previous

QVentures and MongoDB Partner to Support the Next Generation of B2B SaaS Founders

No matter the industry, every startup begins with the same thing — an idea. The challenge is taking that idea and manifesting it into the real world with real world customers. To build a highly scalable and successful venture you need the right funding partner. Every startup needs investment, but what founders must understand is that what is truly paramount to their future success is finding the right funding partner who will be a value add, and not just a capital injection. VC’s such as QVentures fit that mold of being a value addition to the success of a startup’s journey. QVentures is a venture capital firm that provides direct investment opportunities and fund management to take companies from Seed to Series B. Together, MongoDB for Startups and QVentures offer prospective companies their best path forward towards becoming successful. MongoDB’s Startup Partnership Manager Julian Busch spoke with QVentures’ Head of Origination Alex Cochand and Managing Partner Robert Walsh to discuss their company and its partnership with MongoDB for Startups. What is your overall mission at QVentures? Alex Cochand: Our mission is really twofold. From our perspective, one of the major reasons that businesses fail is through a lack of funding. And really that's a discovery problem. Those companies struggle to find interested, active, and supportive investors that buy into their mission. And we support that discovery. Robert Walsh: The mission of QVentures is to work very closely with our investors, who are family offices and UHNWIs, and bring them together with entrepreneurs and founders of businesses between the levels of seed to series B. We very much focus on looking at tech companies for the next generation of investments. The family offices we work very closely with are often the first generation investing into venture capital and are able to pass on their experience to support founders in ways outside of capital. Do you have an investment thesis when investing in startups? Walsh: Our thesis is to invest into enterprise SaaS, marketplaces, B2B enterprise SaaS, and B2B consumer tech businesses that are highly scalable with next generational founders. What advice would you give founders when thinking about fundraising with a VC? Cochand: Start fundraising early. Everyone thinks that it’s going to be a very quick process, you're going to meet the investor of your dreams, and you'll have cash in your bank within a couple of weeks. The reality is that no matter who you are and no matter how great your business is, it always takes longer than you want. There's always more process. There are always hiccups. And you need to make sure that you have more than enough runway to make it through to the end of your fundraise. Are you seeing trends or frequent mistakes that founders make when engaging QVentures? Cochand: Selling the product rather than the business. You sell individual functionalities of the thing that you're building because that's what you're doing day-to-day. Your eyes are directly on building out the product that you want to take out to the market. Instead, when you're speaking to investors, you need to be pitching the mission, the business, and what the opportunity for scale and growth is. Walsh: Being a founder is very difficult. Mistakes are something that I don't think is a fair statement. I would say, we do see trends. We see people who have ideas that might not be good businesses to invest in and who can become very frustrated with that. More importantly, we look for is industry leaders, who are looking to bring technology into new markets. What value do corporate partnerships, like with MongoDB for Startups, bring to the founders in your portfolio? Cochand: We see a huge amount of value in partnerships. It allows us to take the value-add that we offer to our startups to a completely new level. We're very good at the fundraising piece, and that's where we offer our value to the startups that we work with. Through partnering with companies like MongoDB and others, we're able to take that to the 10x. Walsh: What surprised me about our initial partnership, is that companies at various stages in their growth journey are engaging with MongoDB. We’ve seen multiple companies from our Pre-Seed Fund find great value in MongoDB’s services, as well as our portfolio companies who are at later stages. This shows that there's a value in this technology. So focusing again on startup founders, building companies from scratch, finding value in the MongoDB platform, what role does data play in that space? Walsh: Data is a very important piece of the puzzle when you're evaluating a company, because there isn't that much real IP in the idea. It's how you track it, and it's quite frankly execution, and what can you do to learn off of that data. A founder who doesn't use data is a founder who might miss something. Cochand: If we look at where the biggest technological changes are coming from, where the real value is being driven at the moment, a lot of that is coming through technologies, particularly in the ML and AI space. And what drives those, and what enables you to differentiate, is through proprietary access to data. And that's where the real value is with that. If you can mine it in a way that it's accessible and usable, and store in a way that you can then easily access and run your models off of, you're always going to be a step ahead of your competition. Where do you see QVentures in 10 years or do you predict any macro changes in the VC landscape? Walsh: QVentures in 10 years will probably have several billion of assets under management. And I also see the venture capital industry here changing tremendously due to the macro themes that are following the US, such as pension funds will start entering into macro. If you think of the amount of long duration and high yielding assets, I see venture capital following the private equity move of the 90s. So if you look at the KKR and Apollo or anything like that you're going to see QVentures as part of that next wave. Cochand: Where we want to take QVentures in the next 10 years is becoming a hub for access to the venture capital and the tech community for predominantly family offices and ultra high net worths. No matter how they want to invest, no matter how they want to interact with startups, they can come through QVentures for that. If they want to come through a fund structure, if they want to invest directly into singular businesses, or if they want to look at things like venture debt or managed accounts, we have a product offering that we can pass out to them. Title of the document table, th, td { padding: 10px; border: 1px solid black; border-collapse: collapse; } Takeaways for Founders: Start fundraising early “Everyone thinks that there's going to be a very quick process. You're going to meet the investor of your dreams, and you'll have cash in your bank within a couple of weeks,” Cochand said. “The reality is that no matter who you are and no matter how great your business is, it always takes longer than you want. There's always more process. There are always hiccups. And you just want to make sure that you have more than enough runway to make sure that you make it through to the end of your funding event.” Do your own VC diligence prior to engaging Founders should always do their diligence prior to engaging VC’s. Understand the investment thesis of a VC before reaching out. For example, as Robert stated, “Our thesis is to invest into enterprise SaaS, marketplaces, B2B enterprise SaaS, and B2B consumer tech businesses that are highly scalable with next generational founders.” If you are a CPG startup, QVentures would not be a likely investment target for you to engage with. Do not waste your valuable time or the VC’s by reaching out even when they do not invest in your space. When pitching VC’s, don't sell your product, sell your business “Common mistakes that I see founders make when they come to fundraise is selling the product rather than the business,” Cochand said. “You sell individual functionalities of the thing that you're building because that's what you're doing day to day. Your eyes are directly on building out the product that you want to take out to the market. Instead, when you're speaking to investors, you need to be pitching the mission, the business, and what the opportunity for scale and growth is.” Title of the document table, th, td { padding: 10px; border: 1px solid black; border-collapse: collapse; } Takeaways for VC’s: Build value add partnerships with corporations who can fill knowledge gaps in your team “We see a huge amount of value in partnerships. It allows us to take the value-add that we offer to our startups to a completely new level,” Cochand said. “We're very good at the fundraising piece, and that's where we offer our value to the startups that we work with. Through partnering with companies like MongoDB and others, we're able to take that to the 10x.” A prediction on the shifting VC Landscape: “I also see the venture capital industry here changing tremendously due to the macro themes that are following the US, such as pension funds will start entering into macro,“ Walsh said. “If you think of the amount of long duration and high yielding assets, I see venture capital following the private equity move of the 90s. So if you look at the KKR and Apollo or anything like that you're going to see QVentures as part of that next wave.” When looking at potential investment opportunities, VC’s should look for founders who understand and leverage data “Data is a very important piece of the puzzle when you're evaluating a company,” Walsh said. “Because there isn't that much real IP in an idea. It's how you track it, and it's quite frankly execution, and what you do to learn off of that data. A founder who doesn't use data is a founder who might miss something.” Don't be that founder not leveraging their data. Sign up for the MongoDB for Startups program today.

April 6, 2022

Next →

MongoDB.local San Francisco 2026: Ship Production AI, Faster

Today at MongoDB.local San Francisco, we announced capabilities that collapse the distance between AI prototype and production. Building AI applications means solving real problems: keeping conversational context clean and queryable, retrieving the right information from thousands of past interactions, connecting AI agents to your data without custom plumbing. These aren't theoretical challenges, they're the friction points that slow teams down every day. The AI era demands more from your data platform. MongoDB gives you everything you need to build quickly. Voyage AI: the best gets better Embedding models can make or break AI search experiences. We're proud that voyage-3-large has been the world's top-performing embedding model on Hugging Face's RTEB benchmark since its inception. But we didn’t rest on our laurels. There’s a new model at the top of the charts. Today, we're pleased to announce that the Voyage 4 model family is now generally available. The best just got better. The voyage-4 series models operate in a shared embedding space, allowing for cross-model compatibility and unprecedented flexibility to optimize for accuracy, speed, or cost. This release also includes voyage-4-nano, our first open-weight model available on HuggingFace, perfect for local development. Additionally, we're launching the new voyage-multimodal-3.5 model, which has been specifically trained to support video content alongside text and images. For developers building multimodal AI applications, this represents a significant leap forward in handling diverse content types within a single retrieval system. Best of all, upgrading is remarkably straightforward—you can simply change the model parameter to "voyage-multimodal-3.5" in your API call, instantly unlocking video capabilities without needing to refactor your existing codebase or change your application architecture. Finally, we’re announcing the public preview of the Embedding and Reranking API on MongoDB Atlas, providing API support for Voyage AI models. While enabling standalone usage of the models with any technology stack, the API benefits from the robust security and scalability standards of MongoDB. By bringing critical components into a single control plane and interface, it eliminates the need to manage separate vendors and significantly reduces operational overhead. Automated Embedding, convenience built into MongoDB Community Persistence matters. An AI with amnesia isn’t helpful; users need systems to remember context from minutes, hours, and weeks ago. Every interaction is a goldmine of preferences, patterns, and behavior that should make the next interaction smarter. But storing conversation history in a database isn't enough. Simple storage solves nothing if you can't retrieve the right information at the right time. The real challenge is intelligent retrieval: finding relevant context across thousands of past interactions, filtered by metadata and user attributes, without your system buckling under production load. This is where vector search becomes critical—enabling semantic search that captures meaning, not just keywords, while operating on your real-time operational data. And this is where MongoDB's approach eliminates a major pain point: the need to sync data between separate systems for vectors and application data. Until now, generating and storing these vectors required overhead—development time, infrastructure management, and cognitive load. No longer. We're introducing Automated Embedding for MongoDB Community Edition in public preview. MongoDB Community Edition now handles the complexity of managing embedding models automatically, giving developers high-accuracy semantic search in the database while maintaining flexibility to use any LLM provider or orchestration framework. Automated Embedding offers one-click automatic embedding directly inside MongoDB, which eliminates the need to sync data and manage external models. It’s an easy way to get high quality embedding natively. Best-in-class retrieval shouldn't require infrastructure work—Automated Embedding in MongoDB Vector Search delivers on that promise. Automated Embedding in MongoDB Vector Search is available now in Community Edition, with Atlas access coming soon. Precise text filtering for advanced search use cases Today, we announced the launch of Lexical Prefilters for Vector Search. This addresses a long-standing request from developers building semantic search interfaces who need advanced text filtering alongside vector operations. The new syntax enables powerful text filtering capabilities—fuzzy matching, phrase search, wildcards, and geospatial filtering—as prefilters for vector search. This leverages full text analysis capabilities while maintaining the semantic power of vector search. We've introduced a new vector data type in $search index definitions and a vectorSearch operator within the $search aggregation stage to make this work seamlessly. This replaces the knnBeta operator with a cleaner, more powerful approach. For teams already using lexical and vector search together, this provides a simplified migration path with significantly expanded capabilities. Intelligent assistance wherever you work MongoDB’s intelligent assistant is generally available in MongoDB Compass. The assistant provides in-app guidance for debugging connection errors, optimizing query performance, and learning best practices, all without leaving your development environment. You can even query your database using natural language through read-only database tools that require your approval before execution, allowing for deeper contextual awareness of your data. The assistant was built to address real friction: developers switching between multiple tools and documentation tabs, waiting for support responses, or getting generic advice from general-purpose AI chatbots that don't understand MongoDB-specific contexts. Now, tailored guidance is available instantly, right where you're working. The modernized Atlas Data Explorer interface brings the Compass experience directly into the Atlas web UI, addressing a critical gap for teams with security policies that restrict desktop application usage. Users can now perform sophisticated query development, optimization, bulk operations, and complex aggregations—all with AI assistance—across all MongoDB Atlas clusters in a unified web interface. Whether you're troubleshooting a connection issue, optimizing a slow query, or learning how to structure an aggregation pipeline, the intelligent assistant delivers MongoDB-specific expertise without context switching. Try the intelligent assistant in the modernized Atlas Data Explorer now. The engine behind MongoDB Search and Vector Search is now available under SSPL Finally, mongot, the engine powering MongoDB Search and Vector Search, is now publicly available under SSPL. While still in preview, after years of development and investment, we're making the source code of this core technology available to the community, expanding our unified search architecture beyond Atlas to every MongoDB deployment. mongot runs separately from mongod, MongoDB's core database process, and is the foundation that makes powerful search native to MongoDB. Releasing mongot under SSPL means full transparency for security audits and debugging complex edge cases. Developers can dive into mongot's architecture, understand how search and vector operations work under the hood, and help shape the future of search at MongoDB. A modern data platform that evolves with your needs These announcements reflect our commitment to anticipating what developers need as AI development matures. Vector search, time series, stream processing, queryable encryption, Atlas itself—we've consistently delivered on emerging requirements. "If you're building an early-stage company that is going to scale very rapidly, you need a database solution that isn't going to break under the load of a huge volume of users," said Eno Reyes, Co-founder and CTO of Factory. "You need a fast-moving team with a reliable solution, and there really is one option in this space—and it's MongoDB." Rabi Shanker Guha, CEO of Thesys, put it this way: “MongoDB helps us move fast in an ever-changing world. The best database is the one you don’t have to think about—it just works exactly where and how you need it. That’s MongoDB for us.” Ship faster, scale confidently Each capability we announced today addresses real friction in the AI development workflow and in the developer experience. We're not asking developers to choose between structured data and vectors, between performance and flexibility, or between rapid iteration and production readiness. The promise is straightforward: ship faster, scale confidently, and focus on what makes your AI application unique—not on managing database infrastructure. In an ecosystem crowded with point solutions and retrofitted legacy systems, MongoDB is a modern data platform built for the long haul.

January 15, 2026