
Hortonworks Hocks Hadoop Upgrade
Apache Hadoop contributor Hortonworks announced Hortonworks Data Platform version 2. HDPv2 will be using the most recent version of Hadoop (0.23). According to the Apache Software Foundation, curators and cultivators of Hadoop, the newest release is enterprise ready.
The Hortonworks Data Platform, which is powered by Hadoop, is the company’s scalable open source platform for handling big enterprise and research data. As with the other Hadoop distros floating around out there, the key to the success of the platform is the ability to integrate data from just about any source imaginable and provide a more simplified way to make use of it.
The company describes how they differentiate themselves from others offering Hadoop simplification for the enterprise, noting:
“Unlike other Hadoop solutions that lock away management features within proprietary extensions, Hortonworks Data Platform includes Ambari, an open source installation and management system out of the box. Hortonworks Data Platform also includes HCatalog, a metadata management service for simplifying data sharing between Hadoop and other enterprise information systems, along with a complete set of open APIs, including WebHDFS and those for Ambari and HCatalog, to make it easier for ISVs to integrate and extend Apache Hadoop.”
On Jan.6th, when the Apache Software Foundation made news announcing Hadoop v1.0 after 6 years of development, a number of notable new features and enhancements were made. With the release of Hadoop version 0.23, improvements have been made to both HDFS and MapReduce including:
- NextGen MapReduce (also known as YARN)
- HDFS Federation, which allows Namenodes to act independently and without coordination with eachother
- Splitting MapReduce JobTracker into 2 components (resource management and life-cycle management)
- The Resource manager will now manage global assignment of compute resources for each application while ApplicationMaster will manage scheduling and coordination.
According to Eric Baldeschwieler, CEO of Hortonworks, “With more than three years of development and much anticipation, Apache Hadoop 0.23 delivers important advancements in scalability, performance, high availability and data integrit.
He continued, “Apache Hadoop 0.23 is currently being tested across hundreds of applications in the world’s largest Hadoop deployment. We are excited to make the technology advancements in Apache Hadoop 0.23 available through an easily consumable version via the Hortonworks Data Platform v2.”
HDP was created to extremely scalable and fully open-source platform for storage, processing, analysis of large scale data. Along with HDFS and MapReduce, Hortonworks Data Platform includes Pig, Hive, HBase and Zookeeper.
Hortonworks was created by Yahoo! and Benchmark Capital to facilitate Apache Hadoop development. They provide tech support, training and certifications for vendors, enterprises, service providers and systems integrators.
Related Stories
Hadoop Hits Primetime with Production Release
RainStor Brings Database to Hadoop
Karmasphere Ushers in New Hadoop Partner
August 6, 2025
- LF AI & Data Foundation Hosts Vortex Project to Power High Performance Data Access for AI and Analytics
- NetApp Accelerates VMware Migrations with Amazon Elastic VMware Service Integration
- BigID Powers AI Data Readiness with New Cleansing Capabilities for Sensitive and Regulated Data
- Gathr.ai Named a High Performer in G2’s Summer 2025 Grid Reports
- Accenture Invests in Snorkel AI to Help Financial Services Firms Transform Data into AI Solutions
- Espresso AI Launches Kubernetes for Snowflake to Renovate Data Warehouses
- BigID Redefines Data Classification with First-Ever AI-Powered Prompt Engine
- Redpanda Partners with Databricks to Deliver One‑Step Stream‑to‑Table Iceberg Integration for Real‑Time Lakehouses
August 5, 2025
- DataPelago Launches World’s 1st Accelerator for Apache Spark That Leverages Both CPUs and GPUs
- Reltio Unveils AgentFlow, A Set of Agents for Data Governance
- PCI-SIG Announces PCI Express 8.0 Specification to Reach 256.0 GT/s
- Monte Carlo Launches Native Salesforce Integrations for Data and AI Observability
- DDN Showcases AI400X3 Performance in Latest MLPerf Storage Benchmarks
- MLPerf Storage v2.0 Results Highlight Storage’s Role in AI Training at Scale
- Pantomath Raises $30M in Series B to Automate Data Operations with AI DRE Agent
- Cribl Unveils Cribl Guard, Redefining Sensitive Data Protection with Groundbreaking AI Capabilities
August 4, 2025
- Cloudera Acquires Taikun to Deliver Cloud Experience to Data Anywhere for AI Everywhere
- Qbeast Secures $7.6M in Seed Funding to Help Open Data Platforms Scale Efficiently
- DataBahn.ai Unveils Smart Agent to Unify Security and Observability Telemetry
- TDengine Releases IDMP, Enhancing How Industrial Data Is Consumed
- Scaling the Knowledge Graph Behind Wikipedia
- Rethinking Risk: The Role of Selective Retrieval in Data Lake Strategies
- Top 10 Big Data Technologies to Watch in the Second Half of 2025
- LinkedIn Introduces Northguard, Its Replacement for Kafka
- What Are Reasoning Models and Why You Should Care
- Apache Sedona: Putting the ‘Where’ In Big Data
- Top-Down or Bottom-Up Data Model Design: Which is Best?
- Rethinking AI-Ready Data with Semantic Layers
- LakeFS Nabs $20M to Build ‘Git for Big Data’
- How To Keep AI From Making Your Employees Stupid
- More Features…
- Supabase’s $200M Raise Signals Big Ambitions
- Mathematica Helps Crack Zodiac Killer’s Code
- Promethium Wants to Make Self Service Data Work at AI Scale
- Solidigm Celebrates World’s Largest SSD with ‘122 Day’
- McKinsey Dishes the Goods on Latest Tech Trends
- AI Is Making Us Dumber, MIT Researchers Find
- Ryft Raises $8M to Help Companies Manage Their Own Data Without Relying on Vendors
- With $20M in Seed Funding, Datafy Advances Autonomous Cloud Storage Optimization
- The Top Five Data Labeling Firms According to Everest Group
- Toloka Expands Data Labeling Service
- More News In Brief…
- Seagate Unveils IronWolf Pro 24TB Hard Drive for SMBs and Enterprises
- Gartner Predicts 40% of Generative AI Solutions Will Be Multimodal By 2027
- Promethium Introduces 1st Agentic Platform Purpose-Built to Deliver Self-Service Data at AI Scale
- OpenText Launches Cloud Editions 25.3 with AI, Cloud, and Cybersecurity Enhancements
- TigerGraph Secures Strategic Investment to Advance Enterprise AI and Graph Analytics
- StarTree Adds Real-Time Iceberg Support for AI and Customer Apps
- Gathr.ai Unveils Data Warehouse Intelligence
- Graphwise Launches GraphDB 11 to Bridge LLMs and Enterprise Knowledge Graphs
- Databricks Announces Data Intelligence Platform for Communications
- Data Squared Announces Strategic Partnership with Neo4j to Accelerate AI-Powered Insights for Government Customers
- More This Just In…