

(13_Phunkod/Shutterstock)
Retrieval-augmented generation (RAG) is now an accepted part of the generative AI (GenAI) workflow and is widely used to feed custom data into foundation AI models. While RAG works, calls to outside tools can add complexity and latency, which is what led the folks at MongoDB to work with in-database technology to speed things up.
As one of the most popular databases on the planet, MongoDB has developed integrations to support LangChain and LlamaIndex, two popular tools that developers use to build GenAI applications. Developers can also use any external vector database they want to store vector embeddings, indexes, and power queries at runtime.
“There’s of a multitude of ways” to build RAG workflows, says Benjamin Blast, director of product for MongoDB. “But in essence, it’s just adding friction. As a developer, I’m now responsible for finding an embedding model, procuring access to it, monitoring it, metering it — everything associated with pulling in some new component of the stack.”
While MongoDB users have options, the options are not all equal, Blast says. Anytime you go outside of the database, you’re adding friction and latency to the workflow, he says, and a bigger surface areas is also more complex to monitor and fix when things go wrong.
“We see ton of confusion and complexity in the overall market about kind of how to build these systems and how to string things together,” Blast says. “So we’re looking to dramatically simplify that.”
MongoDB wants to simplify things by building more of what GenAI developers need for RAG directly into its database. The company added a vector store by way of the Atlas Vector Search functionality in the fourth quarter of 2023. And earlier this year, it made another big move toward simplification in February when it acquired a company called Voyage AI.

MongoDB says its integration of Voyage AI embedding and reranking models will lead to simpler GenAI architectures (Image courtesy MongoDB)
Voyage AI developed a series of embedding and reranking models designed to accelerate information retrieval in GenAI workloads and improve the overall performance of the apps. These models are offered on Huggingface and are considered to be state-of-the-art.
The Voyage AI embedding models work hand in hand to convert source data into vector embeddings that are stored in the MongoDB vector store. Voyage AI developed a range of embedding models for specific use cases and even specific domains.
“They have a range of embedding models that are of different sizes, that let you choose how good are the results going to be,” Blast tells BigDATAwire in a recent interview. “And then we let you also choose to use what are called domain-specific models, which are fine-tuned on industry specific data, so you can have one for code or one for finance or one for law, so it’ll be even better results on that.”
The Voyage AI reranking models, meanwhile, continuously optimizes the embeddings to ensure the highest accuracy during runtime, for both text and image models. These models boost performance by analyzing the vector queries and responses, and assessing which ones are the best. It will then rerank the queries and the answers (i.e. the pre-created vector embeddings) to ensure the best ones are near the top.
“That will reorder the result set and give you the highest accuracy by giving you another 5% to 7% of performance around accuracy for that result,” Blast says.
The combination of the embedded vector store and the Voyage reranking and embedding models help customers to tune their RAG workflows to ensure their foundation models are getting the data they need to provide good decisions in a timely manner.
“We can do more clever things around the integration to improve the accuracy of the results past just what the models give on their own,” Blast says. “We can make really selective improvements to that overall workflow, from the embedding model to the database to the index, that our customers just would either have a lot of trouble doing and would require a bunch of complexity, or would be fundamentally unable to do on their own.”
MongoDB is currently bringing the vector store and Voyage AI models to MongoDB Atlas, its managed database offering running in the cloud. Vector search will eventually be made available as open source; the company hasn’t determined if Voyage AI models will also be made available as open source, Blast says. Customers can also use the Voyage AI models with LangChain and LlamaIndex if they like.
MongoDB is a notoriously developer-friendly database. Other databases will likely follow its lead in building these types of specialized embedding and reranking models directly into the database. But for now, the New York company is happy to lead in this department.
“We’ve taken, I think, a pretty unique approach that gives customers the benefit of integration,” Blast says. “You get to take advantage of the same set of drivers and other capabilities to make it really easy to use, but on the back end, still scale independently, which is one of the real advantages of MongoDB.”
Related Items:
MongoDB 8.0 Release Raises the Bar for Database Performance
IBM to Buy DataStax for Database, GenAI Capabilities
MongoDB Automates Resharding, Adds Time-Series Support
May 23, 2025
- COMPUTEX 2025 Closes with Record Attendance, Highlights AI Momentum
- MBZUAI Launches Institute of Foundation Models and Establishes Silicon Valley AI Lab
- Elastic Brings Hybrid Retrieval to Microsoft Semantic Kernel
- Capital One Integrates Databolt with Databricks and Snowflake for Scalable Data Security
May 22, 2025
- Qlik Enhances AutoNation’s Marketing ROI with Cloud-Based Insights
- Databricks Announces 2025 Data + AI Summit Keynote Lineup and Data Intelligence Programming
- LMArena Secures $100M in Seed Funding to Bring Scientific Rigor to AI Reliability
- Virtualitics Launches GenAI Toolkit to Power Next-Gen Readiness Agents for Defense
- Nom Nom Secures Patent for Self-Healing Data and AI Pipeline Tech
- Red Hat Unlocks GenAI for Any Model and Any Accelerator Across the Hybrid Cloud with Red Hat AI Inference Server
- SEMI and Purdue University Launch AI and Data Analysis Online Courses
- AIC Collaborates with Micron and Intel to Launch CXL-Enabled Next-Gen Storage Server
May 21, 2025
- VAST Data Unveils AI Operating System to Power Large-Scale Agentic Workflows
- Alluxio Expands AI Platform with Faster Checkpointing and Multi-Tenant Support
- Collibra Expands SAP Partnership with New Data Quality Offer for BDC Users
- Ataccama Strengthens Data Trust with Automated Lineage and Cloud-Native Processing
- Sigma Expands AI-Powered Analytics with Embedded Writeback and File Integration
- SAS Hackathon Winner Develops AI Tool to Combat Misinformation
- Confluent Unveils Snapshot Queries to Power Smarter Agentic AI
- Clarifai Joins Vultr Cloud Alliance to Deliver Scalable, Cost-Optimized, Full-Stack AI
- Informatica Goes All-In on AI Agents for Data Management
- Fine-Tuning LLM Performance: How Knowledge Graphs Can Help Avoid Missteps
- Slash Your Cloud Bill with Deloitte’s Three Stages of FinOps
- PayPal Feeds the DL Beast with Huge Vault of Fraud Data
- Inside the Chargeback System That Made Harvard’s Storage Sustainable
- Ambari Hadoop Cluster Manager is Back on the Elephant
- AI Today and Tomorrow Series #4: Frontier Apps and Bizops
- Databricks to Open Source Unity Catalog
- Three Ways AI Can Weaken Your Cybersecurity
- Vendors Rush to Adopt MCP, the USB-C Cord for AI Integration
- More Features…
- Dataminr Bets Big on Agentic AI for the Future of Real-Time Data Intelligence
- Databricks and KPMG Invest in LlamaIndex to Unlock Scalable Enterprise AI
- Do You Own Your Data? Third-Party Doctrine Says No
- Supabase’s $200M Raise Signals Big Ambitions
- Fivetran Aims to Close Data Movement Loop with Census Acquisition
- Sigma Secures $200M Round to Advance Its BI and Analytics Solutions
- Anaconda Simplifies Open Source Python Stack with AI Platform Launch
- GigaOM Report Highlights Top Performers in Unstructured Data Management for 2025
- Big Data Career Notes April 2025
- These Are the Top Challenges to GenAI Adoption According to AWS
- More News In Brief…
- Gartner Predicts 40% of Generative AI Solutions Will Be Multimodal By 2027
- SAS Unveils AI Agents with Customizable Human-AI Interaction for Transparent Decisioning
- Dataminr Raises $100M to Accelerate Global Push for Real-Time AI Intelligence
- Deloitte Survey Finds AI Use and Tech Investments Top Priorities for Private Companies in 2024
- Databricks Announces Data Intelligence Platform for Communications
- Kroger and NVIDIA to Reinvent the Shopping Experience Through AI-Enabled Applications and Services
- Dataminr Unveils Agentic AI Roadmap to Advance Real-Time Decision-Making
- Argonne Examines Opportunities and Risks of GenAI Tools
- LogicMonitor Expands AI Observability Platform with Agentic AIOps and New Partnerships
- Adastra Named AWS Data Foundation Partner, Helping Organizations Ready Their Data for GenAI
- More This Just In…
Sponsored Partner Content
-
Mainframe data: A powerful source for AI insights
-
CData recognized in the 2024 Gartner ® Magic Quadrant™ Report
-
Introducing AIStor, the most powerful version of MinIO to date
-
Designing a Copilot for Data Transformation
-
Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!
-
Supercharge Your Data Lake with Spark 3.3