Products and Service

Using Cloudera

Conferences and Events

Hadoop World

HBaseCon

Other Events

Resource Type

Resource Type

Video

Presentation Slides

Document

Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

Learn about Cloudera Impala––an open source project that's opening up the Apache Hadoop software stack to a wide audience of database analysts, users, and developers. The Impala massively parallel processing (MPP) engine makes SQL queries of Hadoop data simple enough to be accessible to analysts familiar with SQL and to users of business intelligence tools––and it’s fast enough to be used for interactive exploration and experimentation.

Intel CIO Kimberly Stevenson explains how together, Cloudera and Intel are paving the way for the next era of computing, empowering businesses to place big data analytics at the forefront of their strategies to move faster and more efficiently than competition. Intel and Cloudera are enabling businesses of every industry to solve untapped market challenges and deliver new industry leading profitability.

In this interview that aired on March 22, 2013, Cloudera’s Chief Scientist Jeff Hammerbacher offers fascinating insights into the origins of Big Data and data science techniques, their re-implementation into open source, and their positive application across science as well as business.

At Pinterest, we have been increasingly using HBase for a variety of applications – real-time, interactive, and batch oriented. In this talk, Pinterest's Varun Sharma discusses its experience with architecting and scaling our Feed storage on HBase. “Feeds” are central to user experience at Pinterest and lie on a critical path for user requests.

Understand why content addressable storage is the right pattern for many web use cases, how to foster an already existing HBase cluster for better usage of possibly underutilized resources, and operational gotchas to store and serve BLOBs from HBase at scale.