What Will I Learn?Understand what is Big Data, the challenges with Big Data and how Hadoop propose a solution for the Big Data problemWork and navigate Hadoop cluster with easeInstall and configure a Hadoop cluster on cloud services like Amazon Web Services (AWS)Understand the difference phases of MapReduce in detailWrite optimized Pig Latin instruction to perform complex data analysisWrite optimized Hive queries to perform data analysis on simple and nested datasetsWork with file formats like SequenceFile, AVRO etcUnderstand Hadoop architecture, Single Point Of Failures (SPOF), Secondary/Checkpoint/Backup nodes, HA configuration and YARNTune and optimize slowing running MapReduce jobs, Pig instructions and Hive queriesUnderstand how Joins work behind the scenes and will be able to write optimized join statementsWherever possible, students will be introduced to difficult questions that are asked in real Hadoop interviews

RequirementsAlthough you don't have to be an expert in Java, basic knowledge in Java programming is required as we will be looking at programs in Java.Basic Linux commands

DescriptionFrom the creators of the successful Hadoop Starter Kit course hosted in, comes Hadoop In Real World course. This course is designed for anyone who aspire a career as a Hadoop developer. In this course we have covered all the concepts that every aspiring Hadoop developer must know to SURVIVE in REAL WORLD Hadoop environments.

The course covers all the must know topics like HDFS, MapReduce, YARN, Apache Pig and Hive etc. and we go deep in exploring the concepts. We just don't stop with the easy concepts, we take it a step further and cover important and complex topics like file formats, custom Writables, input/output formats, troubleshooting, optimizations etc.

All concepts are backed by interesting hands-on projects like analyzing million song dataset to find less familiar artists with hot songs, ranking pages with page dumps from wikipedia, simulating mutual friends functionality in Facebook just to name a few.Who is the target audience?This course is for anyone who aspire a career as a Hadoop DeveloperThis course is for anyone who want to learn and understand in depth about Hadoop and Big Data