Hadoop Certifications

The Hadoop certification landscape

The Apache Hadoop organization provides tutorials on Hadoop technology but does not offer certifications or endorse any certification programs. Hadoop distribution vendors have taken on the professional development by creating their own education, testing, and certification programs. These programs are specific to the distribution environment, but the fundamental education about core Hadoop components like HDFS, YARN, Hive, Pig, Spark, etc. is transferrable to other environments. The following sections provide an overview of the certification programs from the three leading Hadoop distribution vendors.

Cloudera

Cloudera University provides certification training and other courses in tracks for administrators, data analysts, and developers. Certifications currently available through the Cloudera Certified Professional (CCP) Program include:

CCP Data Engineer: Focuses on how to build the “pipelines” to produce data sets that are optimized for different types of workloads.

MCHBD - MapR Certified HBase Developer: This certification reflects proficiency in the HBase development programs that use HBase as a distributed NoSQL datastore.

MCSD - MapR Certified Spark Developer: This certification demonstrates proficiency in Apache Spark programming to work with large datasets.

MapR Academy also offers numerous on-demand courses and other resources in addition to the certification courses.

Note there are other big data training and certification programs that are not Hadoop specific. For example, Microsoft, Oracle, and SAS Institute are among the technology companies that have their own big data education programs.