Sachin Smotra

2 results

Using Agentic RAG to Transform Retail With MongoDB

In the competitive world of retail and ecommerce, it’s more important than ever for brands to connect with customers in meaningful, personalized ways. Shoppers today expect relevant recommendations, instant support, and unique experiences that feel tailored just for them. Enter retrieval-augmented generation (RAG) : a powerful approach that leverages generative AI and advanced search capabilities to deliver precise insights on demand. For IT decision-makers, the key challenge lies in integrating operational data with unstructured information—which can span object stores (like Amazon S3 and SharePoint), internal wikis, PDFs, Microsoft Word documents, and more. Enterprises must unlock value from curated, reliable internal data sources that often hold critical yet hard-to-access information. By combining RAG’s capabilities with these data assets, retailers can find contextually accurate information. For example, they can seamlessly surface needed information like return policies, refund processes, shipment details, and product recalls, driving operational efficiency and enhancing customer experiences. To provide the most relevant context to a large language model (LLM) , traditional RAG (which has typically relied on vector search) needs to be combined with real-time data in an operational database, the last conversation captured in a customer relationship management API call to a REST endpoint, or both. RAG has evolved to become agentic—that is, it’s capable of understanding a user inquiry and translating it to determine which path to use and which repositories to access to answer the question. MongoDB Atlas and Dataworkz provide an agentic RAG as a service solution that enables retailers to combine operational data with relevant unstructured data to create transformational experiences for their customers. MongoDB Atlas stores and unifies diverse data formats—such as customer purchases, inventory levels, and product descriptions—making them easily accessible. Dataworkz then transforms this data into vector embeddings, enabling a multistep agentic RAG pipeline to retrieve and create personalized, context-aware responses in real time. This is especially powerful in the context of customer support, product recommendations, and inventory management. When customers interact with retailers, Dataworkz dynamically retrieves real-time data from MongoDB Atlas, and, where needed, combines it with unstructured information to generate personalized AI responses, enhancing the customer experience. This architecture improves engagement, optimizes inventory, and provides scalable, adaptable AI capabilities, ultimately driving a more efficient and competitive retail operation. Reasons for using MongoDB Atlas and Dataworkz MongoDB Atlas and Dataworkz work together to deliver agentic RAG as a service for a smarter, more responsive customer experience. Here’s a quick breakdown of how: Vector embeddings and smart search: The Dataworkz RAG builder enables anyone to build sophisticated retrieval mechanisms that turn words, phrases, or even customer behaviors into vector embeddings—essentially, numbers that capture their meaning in a way that’s easy for AI to understand—and store them in MongoDB Atlas. This makes it possible to search for content based on meaning rather than exact wording, so search results are more accurate and relevant. Scalable, reliable performance: MongoDB Atlas’s cloud-based, distributed setup is built to handle high-traffic retail environments, minimizing disruptions during peak shopping times. Deep context with Dataworkz’s agentic RAG as a service: Retailers can build agentic workflows powered by RAG pipelines that combine lexical and semantic search with knowledge graphs to fetch the most relevant data from unstructured operational and analytical data sources before generating AI responses. This combination gives ecommerce brands the power to personalize experiences at a vastly larger scale. Figure 1: Reference architecture for customer support chatbots with Dataworkz and MongoDB Atlas Retail e-commerce use cases So how does this all work in practice? Here are some real-world examples of how MongoDB Atlas and Dataworkz are helping ecommerce brands create standout experiences. Building smarter customer-support chatbots Today’s shoppers want quick, accurate answers, and RAG makes this possible. When a customer asks a chatbot, “Where’s my order?” RAG enables the bot to pull the latest order and shipping details stored in MongoDB Atlas. Even if the question is phrased differently—say, “I need my order status”—the RAG-powered vector search can interpret the intent and fetch the correct response. As a result, the customer gets the help they need without waiting on hold or navigating complex menus. Personalizing product recommendations Imagine a customer who’s shown interest in eco-friendly products. With MongoDB Atlas’s vector embeddings, a RAG-powered system can identify this preference and adjust recommendations accordingly. So when the customer returns, they see suggestions that match their style—like organic cotton clothing or sustainably sourced kitchenware. This kind of recommendation feels relevant and thoughtful, making the shopping experience more enjoyable and increasing the chances of a purchase. Creating dynamic marketing content Marketing thrives on fresh, relevant content. With MongoDB Atlas managing product data and Dataworkz generating personalized messages, brands can send out dynamic promotions that truly resonate. For example, a customer who browsed outdoor gear might receive a curated email with top-rated hiking boots or seasonal discounts on camping equipment. This kind of targeted messaging feels personal, not pushy, building stronger customer loyalty. Enhancing site search experiences Traditional e-commerce searches often rely on exact keyword matches, which can lead to frustrating dead ends. But with MongoDB Atlas Vector Search and Dataworkz’s agentic RAG, search can be much smarter. For example, if a customer searches for “lightweight travel shoes,” the system understands that they’re looking for comfortable, portable footwear for travel, even if none of the product listings contain those exact words. This makes shopping smoother and more intuitive and less of a guessing game. Understanding trends in customer sentiment For e-commerce brands, understanding how customers feel can drive meaningful improvements. With RAG, brands can analyze reviews, social media comments, and support interactions to capture sentiment trends in MongoDB Atlas. Imagine a brand noticing a spike in mentions of “too small” in product reviews for a new shoe release—this insight lets them quickly adjust sizing info on the product page or update their stock. It’s a proactive approach that shows customers they’re being heard. Interactions that meet customers where they are In essence, MongoDB Atlas and Dataworkz’s RAG models enable retailers to make e-commerce personalization and responsiveness smarter, more efficient, and easier to scale. Together, they help retailers deliver exactly what customers are looking for—whether it’s a personalized recommendation, a quick answer from a chatbot, or just a better search experience. In the end, it’s about meeting customers where they are, with the information and recommendations they need. With MongoDB and Dataworkz, e-commerce brands can create that kind of connection—making shopping easier, more enjoyable, and ultimately more memorable. Learn more about Dataworkz on MongoDB by visiting dataworkz.com . The Dataworkz free tier is powered by MongoDB Atlas Vector Search .

December 23, 2024

Overcome the Biggest Obstacle to True Customer 360: with MongoDB Atlas and Dataworkz

The more you know about your customers, the better you can attract them and increase their lifetime value to your business. The ability of modern systems to collect, store, and leverage customer data across disparate systems has raised the bar for what it means to understand your customer. To the winner goes the spoils, and the spoils go to the companies that navigate the last mile in the journey to attain true Customer 360: they leave no customer data behind. They hunt down every relevant source of customer data in their reach and turn them into valuable insight – without breaking a sweat. How customer acquisition costs and customer lifetime value impacts business success Two calculations are critical to understanding modeling how to grow your business. Customer Acquisition Costs (CAC): the amount of money a company spends to get a new customer Customer Lifetime Value (CLV): the net profit attributed to the customer relationship for the duration of the time they use a product Note that “customer” is front and center in both terms. Capturing the data behind these two calculations correctly and continuously on a timely basis across the whole customer journey – from first touch to renewals – and making sense of it, is essential to understanding how to grow and retain your customers. These calculations must be performed on multiple customers and their interactions over a span of years. Analytical and AI models that calculate CLV and LTV need to be fed up-to-date, contextual, reliable data to identify relevant patterns. Historical customer data guides future CLV and influences decisions on how to optimize investments in sales, marketing, and customer service to positively impact the customer’s future purchase and product adoption behaviors. Similarly, customer data analysis guides decisions on how to optimize sales and marketing investments to dial-in CAC for the fastest and most profitable conversion of prospects to customers. How successful a company is at leveraging CLV, and LTV calculations depends on their ability to link different data sources and formats, and integrate all of the internal and external systems required for modeling. Until now, companies have struggled to accomplish this because data joining technologies have fallen short for one or more of these reasons: cost, complexity, and scalability. Forrester1 declared Activation Of Unstructured “Dark” Data a 2022 mega trend. According to Forrester” "The question is no longer about collecting enough data — it’s about ensuring that the data is usable and liberated from company silos to create new streams of value. In 2022, being “data driven” must become more than just a slogan — the next wave of business transformation will hinge on activating “dark” data (data that organizations collect but do not effectively use) to drive differentiated experiences for both customers and employees." Creating that holistic 360° view of your customer is a business imperative for every organization–but creating a unified view or golden record of your customers is hard and expensive, especially if you want to capture and make sense of the structured and coherent unstructured data and include unstructured “dark” data. Why creating a customer 360 is a technical challenge Customer 360 data resides in your ecommerce platforms, Customer Relationship Management (CRM), Sales Force Automation (SFA), ERP, Customer Service systems, Partner Relationship Management programs, loyalty programs, payment portals, web apps, mobile apps and more–across business units and functional departments. Cross-object reporting in a single application (like Salesforce) can be a challenge but cross application reporting (SFDC + Zoom + Zendesk + Parquet files in a data lake) present an even bigger set of challenges with no common identifiers in the data collected from various applications. The data not only comes from a variety of sources but also in different formats significant time and data engineering effort is required to combine data to create the elusive golden record. To compound the challenge, different teams require different views of the same data which demands different flavors of customer 360: The power to harness customer data where it belongs: Business users The traditional approach to creating a customer 360 takes the view that data from disparate sources like Prospecting (outreach.io), CRM(Salesforce), Marketing Automation (Marketo), Web Meetings (Zoom) and Support (Zendesk) needs to be combined, converted into a star or snowflake schema and deposited in an enterprise data warehouse. Or buy a Customer Data Platform (CDP) that makes yet another copy of your data that is a black box with no visibility into how data is processed. Keep in mind, this approach requires months of back and forth with the IT organization, resulting in cost and time overruns that lead to costly, stale customer 360 data that is no longer relevant. Dataworkz gives anyone from marketing ops to sales leaders the ability to assemble a complete, reliable customer 360 in minutes or hours. Unlike a legacy CDP, Dataworkz eliminates data security concerns as a barrier because it does the job without storing customer data. Users configure their sources and destinations, then Dataworkz acts as the “personal data engineer” bringing together data, processing, and ML with a visual no-code interface for business users – with the option to use SQL. A visual, easy-to-use with a no-code approach, to seamlessly gain a 360 view of your entire customer journey is the most innovative and efficient way to gain the contextual and quick customer insights necessary to grow your business. No more kinks in your Customer 360 data: Dataworkz and MongoDB Atlas Getting the best of all breeds is the mantra of smart, future-proof solutions. Capitalizing on Dataworkz and MongoDB Atlas together is a “no brainer.” Connecting Dataworkz and Atlas Add the Dataworkz cluster IP to the list of white listed IP addresses in the “network access” section of the Atlas console. Navigate to the “Databases'' section in the Dataworkz configuration section and specify the Atlas cluster IP address, username and password. Here are three ways this partnership can help: Get out of your RDBMS: Combining data from different sources, in different formats, with changing schema, is exceptionally hard, if not impossible with a rigid relational model. But MongoDB Atlas uses a JSON data model with a flexible schema which makes it ideal for building a customer 360. Work with a flexible, adaptive construct: Agility is a key tenet for organizations constantly adapting to their customer’s ever changing needs. With dynamic schemas, you are no longer worried about changes in source data breaking downstream applications. In minutes, not months: Dataworkz and MongoDB Atlas are SaaS services with usage based pricing. You get started without an upfront investment or hiring a team of data engineers and scientists and get to value in a matter of minutes and not weeks or months. Customer case study - Gunderson Dettmer Gunderson Dettmer is the preeminent international law firm with an exclusive focus on the innovation economy. In order to serve their clients, they need to leverage the latest and greatest information about corporate innovators anywhere in the world, from a wide variety of disparate data sources. To accomplish this, they had to transform and integrate multiple data sources into a coherent, evolving data warehouse and attempt to match financial, legal, and personnel information across “un-mappable” data sets without any common identifiers or keys. The process had to be scalable, reliable, flexible, and continuous since new information and new data sources are added daily. Gunderson needed to compare attributes for venture-backed companies from multiple internal and external datasets, but there were no globally unique identifiers available to join the datasets in most cases, and there were no off-the-shelf, end-to-end solutions to solve the problem. Working with Dataworkz, we developed an automated, scalable fuzzy matching pipeline that feeds likely matches to our internal infrastructure for manual review and approval, ultimately flowing into our MongoDB data warehouse. Maria Concordia, Data Operations and Analytics Architect The objective Gunderson's data warehouse and analytics team were tasked with creating a brand new process for unlocking data insights by weaving together previously un-mappable data sets and serving them up to the organization to create an evolving, holistic view of the Venture Capital universe. The resulting data warehouse would be leveraged by downstream users, including data analysts, upper management, marketing, and other firm stakeholders. The challenge with no single source of truth Gunderson Dettmer needed to compare attributes for companies and venture capital investors across multiple internal and external datasets, but there were no globally unique identifiers available to join the datasets in most cases, and there were no off-the-shelf, end-to-end solutions to solve the problem. Data sources included Gunderson’s on-premise proprietary data held in a SQL-based relational database, CSV data dumps from Pitchbook, Crunchbase, and various PDF files, and other potential sources, as available. Although the data sets overlapped, there were no common keys or identifiers across them. How they used power of MongoDB Atlas + Dataworkz Before the data warehouse team at Gunderson got involved, the de facto approach was to create interim data sets for the entity matching process by extracting data from the relational database, performing multiple Excel spreadsheet lookups to enrich it, and then manually matching the exported data to the external data sources. This process typically took 3-4 weeks in order to create a unified data set that could be used for analytics. The process was static and needed to be repeated (complete with manual record matching) when any of the data sources were updated. Not thrilled with this process, the data warehouse team created python scripts using Request and Fuzzy-Wuzzy modules to call a REST API endpoint, programmatically joining the datasets and then performing fuzzy logic for matching. Unfortunately, the REST API and Fuzzy matched datasets created were also one-time-only events and could not be leveraged for automating future analysis or ongoing data integration. This is where MongoDB Atlas was introduced to change the one-time only dataset into a JSON collection that could be reused. It required transforming the resulting datasets into rich, nested JSON objects that could be searched and queried using MongoDB Atlas. While this reduced the data set preparation time to about one week, there was still a large human review process that required Subject Matter Experts (SMEs) for difficult matches that fell out of the simplistic, fuzzy match parameters. For the final step in this project, they replaced the python scripts for transforming raw data into the JSON data model with Dataworkz to ingest data into MongoDB Atlas. The Dataworkz platform provided crucial features that vastly simplified the process of weaving the disparate “unmappable” data sets together into a coherent whole. Using Dataworkz allowed Gunderson’s team to eliminate their python scripts and enabled: Discovery, introspection, and manipulation of Parquet, BSON, JSON, CSV, structured, and unstructured data sources by bringing them all onto the same playing field for transformation, A clearer understanding of the possible challenges to creating easy-to-match collections, Automatically perform JSON format transformations without code, The ability to describe and execute no-code multi-stage pattern matching Automate their ingestion, transformation, and data storage pipeline in a day (rather than the months that it took to write the original python scripts), Provide continuous automatic data integration within minutes, with improved accuracy, lower human intervention, and built-in data monitoring and governance to ensure against data stream contamination in the future. Don't struggle any more for the metrics you need Whether you are a Chief Revenue Officer, CMO, head of a regional team, leading customer service, or a data analyst/scientist; you no longer need to spend extended time and dollars to gain the insights that should be available at your fingertips. Timely, reliable CAC and CLV metrics that are correct no matter what changes occur in your processes, definitions, calculations, or systems can be yours – when you need them – across departments, product lines, and business units. Dataworkz marks the entry of a new class of data platform that delivers the insights you need – accessible across any data, from any source, combined for any use, when you need it.

April 11, 2023