
LlamaIndex Announces Investments from Databricks and KPMG LLP
SAN FRANCISCO, May 2, 2025 — LlamaIndex, a leading platform for agentic document workflows, has announced minority equity investments from the data and AI company Databricks and global professional services firm KPMG LLP. These investments will accelerate development and adoption of LlamaIndex’s flagship offerings—LlamaCloud and LlamaParse—which have become foundational tools for enterprises building production-grade knowledge agents on top of their unstructured data.
LlamaIndex provides a flexible, developer-first framework for building intelligent agents that can reason, retrieve, and act on complex enterprise documents. Its platform includes LlamaParse, the industry-leading parser for complex documents, and LlamaCloud, a secure, scalable SaaS and on-prem offering that handles ingestion, indexing, and retrieval workflows. By combining deep knowledge management capabilities with advanced chunking, embedding, and retrieval infrastructure, LlamaIndex enables organizations to operationalize Retrieval-Augmented Generation (RAG) and agent-based systems faster and more reliably than ever before.
“Databricks and LlamaIndex share a common vision of democratizing access to AI by making it easier for organizations to harness the power of their data,” said Patrick Wendell, Co-Founder & VP Engineering at Databricks. “LlamaIndex’s technology addresses a critical need in the enterprise AI stack, enabling companies to quickly build production-ready AI applications that leverage their proprietary data. This investment aligns perfectly with our mission to help customers drive innovation through data intelligence.”
“As we continue to innovate and push boundaries in applied AI, a robust data foundation is essential for building effective AI systems, particularly sophisticated knowledge assistants and agentic solutions,” said Swami Chandrasekaran, Principal, AI & Data Labs Leader at KPMG. “LlamaCloud and LlamaIndex provide the frameworks necessary to access, curate, and ingest data at-scale, enabling KPMG to develop differentiated, industry-specific solutions that deliver measurable business outcomes for our clients.”
The investment from Databricks comes through its Databricks Ventures’ AI Fund, which was established to support innovative startups utilizing or enabling AI in conjunction with the Databricks Data Intelligence Platform. This move follows Databricks’ recent investments in other AI-focused companies including Mistral AI, Perplexity, and Cleanlab. KPMG’s investment is spearheaded by KPMG Ventures, which is dedicated to collaborating with and investing in early-stage start-ups in areas like agentic AI, data infrastructure, cybersecurity, and more. KPMG Venture’s minority equity investment follows recent investments in other AI-driven startups including Ema, Wokelo and Rhino.AI.
“We’re excited to work with industry leaders like Databricks and KPMG to bring enterprise-grade LLM infrastructure to more organizations worldwide,” said Jerry Liu, Co-Founder and CEO of LlamaIndex. “These investments will accelerate our product development and go-to-market efforts, allowing us to help more enterprises solve the complex challenges of building production-ready AI systems.”
Databricks and LlamaIndex have customers in common, such as Cemex and Carlyle Group. “LlamaCloud completely changed our velocity,” said Daniel G Zapata, Senior Data Scientist at Cemex. “Our small team of data scientists have spun up 10 production‑grade use‑cases in a couple of months. Tasks that used to drag on for weeks now ship in days, because LlamaCloud takes care of the heavy‑lifting (data ingestion, various retrieval techniques etc) so we can focus on product. Additionally, the LlamaIndex team is very very collaborative and have helped us design multiple agents.”
“As the Applied AI Lead at Carlyle, one of Databricks’ enterprise clients, I’ve evaluated numerous document processing solutions for our AI initiatives. LlamaIndex’s LlamaParse stands out as the premier solution for integrating complex documents into our advanced analytics pipeline,” said Dean Barr, MD, Head of Applied AI and Data Scientist at The Carlyle Group. Its exceptional handling of nested tables, complex layouts, and image extraction has been instrumental in our data-driven investment strategies, particularly when combined with Databricks’ powerful data processing capabilities.”
The investments come at a time of significant momentum in the enterprise AI space. The combination of Databricks’ Data Intelligence Platform with LlamaIndex’s data infrastructure for LLMs offers enterprises powerful solutions for implementing generative AI applications. Customers can benefit from streamlined integrations, enabling faster deployment of AI solutions that leverage both structured and unstructured data.
Financial terms of the investments were not disclosed.
About LlamaIndex
LlamaIndex is the world’s leading framework for building agentic document workflows on top of unstructured enterprise data. Founded in 2023, LlamaIndex provides a complete platform for knowledge agent development—combining industry-leading tools for parsing, indexing, and retrieval with a scalable event-driven architecture for agent orchestration. Llamaindex is trusted by hundreds of thousands of developers as well as by enterprise gen AI teams at the Fortune 50.
Source: LlamaIndex