Apache Hadoop for System Administrators

Allen Wittenauer, LinkedIn, Inc.

Abstract:

Knowledge is power! As a result, the adoption of Apache Hadoop to help mine data as a way to increase knowledge is taking the world by storm. For system administrators, however, it is a large, complicated system that isn't well understood. In this talk, Allen will cover some Hadoop basics from an operations perspective: what it is, how it works, key data points to monitor, metrics that are important to gather, and the secrets to making it work securely and reliably.

Allen Wittenauer has been involved with Apache Hadoop since May 2007, when he was hired by Yahoo! to bring large-scale operational experience to the fledgling project. His work there helped create the basic blueprints that almost all Hadoop deployments follow today. At LinkedIn, his experience provided key insight and a foundation to its award-winning data science team.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.