Follow BigDATAwire:

Tag: databricks

What’s In the Pipeline for Apache Spark?

According to Apache Spark creator Matei Zaharia, Spark will see a number of new features and enhancements to existing features in 2017, including the introduction of a standard binary data format, better integration with Read more…

How These Banking, Energy, and Pharma Firms Use Spark

Few frameworks have gained so much popularity as quickly as Apache Spark.  The open source technology may not be ubiquitous yet in the analytics world, but it's fast approaching that point. Spark has certainly caught Read more…

Spark ML Runs 10x Faster on GPUs, Databricks Says

Apache Spark machine learning workloads can run up to 10x faster by moving them to a deep learning paradigm on GPUs, according to Databricks, which today announced that its hosted Spark service on Amazon's new GPU cloud. Read more…

Databricks CEO on Streaming Analytics, Deep Learning, and SQL

As Apache Spark continues to gain steam, so too does Databricks, the company behind the popular distributed processing framework. At the recent Strata + Hadoop World conference, we caught up with Databricks CEO and co-fo Read more…

Apache Spark Adoption by the Numbers

It's been about three years since Apache Spark burst onto the big data scene and became one of the hottest technologies on the planet. Judging by the numbers surrounding Spark's adoption—including things like salaries, Read more…

Spark 2.0 to Introduce New ‘Structured Streaming’ Engine

The folks at Databricks last week gave a glimpse of what's to come in Spark 2.0, and among the changes that are sure to capture the attention of Spark users is the new Structured Streaming engine that leans on the Spark Read more…

Spark Streaming: What Is It and Who’s Using It?

A recent study of over 1,400 Spark users conducted by Databricks, the company founded by the creators of Spark, showed that compared to 2014, 56 percent more Spark users globally ran Spark Streaming applications in 2015. Read more…

Apache Spark Gets IBM Mainframe Connection

IBM's recent embrace of Apache Spark is beginning to generate dividends in the form of open source contributions for a mainframe big data link to Spark. Big data software vendor Syncsort, Woodcliff Lake, N.J., said Tu Read more…

Spark 1.5 to Incorporate ‘Tungsten’ Upgrades

A preview release of the Apache Spark open source in-memory processing framework incorporates major performance upgrades, according to Databricks Inc., the big data processing company founded by Spark's creators. Data Read more…

IBM, Databricks Join Forces to Advance Spark

IBM has jumped on the Apache Spark bandwagon, revealing it would throw its considerable weight behind the open source in-memory processing framework that has been gaining momentum over the last year. Separately, Datab Read more…

Apache Spark Ecosystem Continues To Build

Apache Spark was everywhere at the recent Strata + Hadoop World conference. From Tableau's new Spark interface to the new Spark as a service (SaaS) offerings and Intel's new Spark initiative, the big data framework was v Read more…

Apache Spark Continues to Spread Beyond Hadoop

Apache Spark is most often thought of as a faster replacement for MapReduce, the batch-oriented programming framework that enabled first-gen Hadoop to catch traction 10 years ago. Indeed, Spark was initially created with Read more…

Where Does Spark Go From Here?

The excitement behind Apache Spark reached an apex last week during the 2014 Spark Summit put on by Databricks, the company behind the in-memory analytics phenomenon. With a large community of users and growing support f Read more…

BigDATAwire