The Hortonworks Blog

Cisco and Hortonworks established their official alliance back in 2013. Together, they have been bringing to life the vision of a single big data platform for the enterprise. As every industry is witnessing unprecedented quantities of data and a variety of new data types e.g. clickstream and behavior, machine and sensor, geographic data, server logs, sentiment and web…, Cisco and Hortonworks have been collaborating to empower companies with their data. Oftentimes, organizations need to optimize their IT infrastructure and free up their Enterprise Data Warehouse (EDW) to make the most of all of their data, building new analytic applications and moving towards the vision of the Data Lake.…

This is the second post in a series exploring the theme of long-running service workloads in YARN. See for the introductory post.

Long-running services deployed on YARN are by definition expected to run for a long period of time—in many cases forever. Services such as Apache™ HBase, Apache Accumulo and Apache Storm can be run on YARN to provide a layer of services to end users, and they usually have a central master running in conjunction with an ApplicationMaster (AM).…

Analysts and data scientists⎯not to mention business executives⎯want Big Data not for the sake of the data itself, but for the ability to work with and learn from that data. As other users become more savvy, they also want more access. But too many inefficient queries can create a bottleneck in the system.

The good news is that Apache™ Hive 0.14—the standard SQL interface for processing, accessing and analyzing Apache Hadoop® data sets—is now powered by Apache Calcite.…

Leading enterprise organizations have concluded that YARN-enabled Hadoop is foundational to their modern data architectures. These companies subscribe with Hortonworks (and implement Hortonworks Data Platform) to bring additional types of data under management, merge those with legacy datasets, and unlock new business insight.

But don’t take our word for it.

Watch these brief videos and hear our customers describe how a data-first approach is transforming their businesses.

Advertising

Luminar is the leading big data analytics and modeling provider uniquely focused on delivering actionable insights on U.S.…

This is the third post in a series exploring recent innovations in the Hadoop ecosystem that are included in Hortonworks Data Platform (HDP) 2.2. In this post, we introduce the theme of supporting rolling upgrades and downgrades of a Hadoop YARN cluster.

HDP 2.2 offers substantial innovations in Apache™ Hadoop YARN, enabling Hadoop users to efficiently store and interact with their data in a single repository, simultaneously using a wide variety of engines.…

Hortonworks provides enterprise Hadoop for the telecommunications service provider, and Hortonworks Data Platform (HDP) is architected from the ground up with the centralized YARN-based architecture and core enterprise services for data governance, security and cluster operations that can revolutionize your telecommunications business.

As the originators of Hadoop, leaders in the developer community, and partners for your success, nobody is better to help you become a data-centric telecommunications enterprise.

As a data scientist working with Hadoop, I often use Apache Hive to explore data, make ad-hoc queries or build data pipelines.

Until recently, optimizing Hive queries focused mostly on data layout techniques such as partitioning and bucketing or using custom file formats.

In the last couple of years, driven largely by the innovation of the Hive community around the Stinger initiative, Hive query time has improved dramatically, enabling Hive to support both batch and interactive workloads at speed and at scale.…

We are excited to be working with and announcing ClearStory Data’s integration with Hortonworks Data Platform (HDP) during Strata + Hadoop World 2015. This partnership with Hortonworks is significant as it brings ClearStory’s business-ready, fast-cycle, scalable analysis on Hadoop Data Lakes and specifically on the Hortonworks Data Platform (HDP).…

This is a unique moment in time. Fueled by open source, Apache Hadoop has become an essential part of the modern enterprise data architecture and the Hadoop market is accelerating at an amazing rate.

The impressive thing about successful open source projects is the pace of the “release early, release often” development cycle, also known as upstream innovation. The process moves through major and minor releases at a regular clip and the downstream users get to pick the releases and versions they want to consume for their specific needs.…

Today we’re excited to be jointly announcing with EMC that the Isilon OneFS file system has been certified to work with the Hortonworks Data Platform (HDP). Now Isilon customers who are looking for a robust, enterprise-ready, stable Apache Hadoop platform can use HDP on their Isilon implementations.

Joint Engineering Delivering Choice

We’re excited to see the results of the months of engineering and testing efforts that now provide customers even greater deployment choice for their Hadoop projects as they are implementing a modern data architecture towards a data lake.…

OspreyData is a Hortonworks® technology partner whose solution is certified both for Hortonworks Data Platform and YARN. The company delivers agile big data analytics solutions for the oil and gas industry. In this blog, Al Brown, CTO at OspreyData, shares his thoughts on how the industry is addressing a big problem: unplanned interruptions to production.

A Mandate for Operational Efficiency and Margin Growth

The oil and gas industry is constantly challenged with a mandate to operate more efficiently—both in the oilfield and within the data center.…

Today Microsoft announced two important new updates to their Azure HDInsight Service with Apache Hadoop 2.6, now available on new clusters.

We are excited to continue to work alongside Microsoft in expanding the deployment options to the Linux Operating System for managed Hadoop as a Service Azure HDInsight clusters. The HDInsight on Linux Preview leverages the completely open Apache Ambari framework to deploy, manage and monitor Hadoop clusters on premise or in the cloud.…

There are lots of ways to interact with Hortonworks at this weeks Strata +Hadoop World event.

Exhibitor Booth 1321

While at our booth you can talk with our experts and get the latest on Hortonworks, get an overview of Apache Hadoop or hear more about how we are helping organizations drive success with Hadoop. You can also get one of the popular Hortonworks elephants!

Passport Program

While at our booth you can pick up a Passport Card to that you can enter for a chance to win some great prizes from one of the 24!…

Hortonworks has expanded its certification program to create an industry-recognized certification program where individuals prove their Hadoop knowledge by performing hands-on tasks on a Hortonworks Data Platform (HDP) cluster, as opposed to answering multiple-choice questions. Hortonworks University will be offering three new certification exams:

HDP Certified Developer

HDP Certified Java Developer

HDP Certified Administrator

The HDP Certified Developer (HDPCD) exam is the first of our new hands-on, performance-based exams designed for Hadoop developers working with frameworks like Pig, Hive, Sqoop and Flume.…