Big Data Hadoop

Talend For Hadoop Training Curriculum 0

Talend For Hadoop Training

Talend For Hadoop Training Course Content

Getting started with Talend

Working of Talend,Introduction to Talend Open Studio and its Usability,What is Meta Data?

Jobs

Creating a new Job,Concept and creation of Delimited file,Using Meta Data and its Significance,What is propagation?,Data integration schema,Creating Jobs using t-filter row and string filter,Input delimation file creation

Overview of Schema and Aggregation

Job design and its features,What is a T map?,Data Aggregation,Introduction to triplicate and its Working,Significance and working of tlog,T map and its properties

Connectivity with Data Source

Extracting data from the source,Source and Target in Database (MySQL),Creating a connection, Importing Schema or Metadata

Getting started with Routines/Functions

Calling and using Functions,What are Routines?,Use of XML file in Talend,Working of Format data functions,What is type casting?

Data Transformation

Defining Context variable,Learning Parameterization in ETL,Writing an example using trow generator,Define and Implement Sorting,What is Aggregator?,Using t flow for publishing data,Running Job in a loop

Hands on Exercises

Installing Hadoop in Pseudo Distributed Mode, Understanding Important configuration files, their Properties and Demon Threads,Accessing HDFS from Command Line

Map Reduce – Basic Exercises,Understanding Hadoop Eco-system,Introduction to Sqoop, use cases and Installation,Introduction to Hive, use cases and Installation,Introduction to Pig, use cases and Installation,Introduction to Oozie, use cases and Installation,Introduction to Flume, use cases and Installation,Introduction to Yarn

Mini Project – Importing Mysql Data using Sqoop and Querying it using Hive

Macros and Imports,UDFs,Using Other Languages to Process Data with Pig,Hands-On Exercise: Extending Pig with Streaming and UDFs

Pig Jobs

Impala

Introduction to Impala

What is Impala?,How Impala Differs from Hive and Pig,How Impala Differs from Relational Databases,Limitations and Future Directions Using the Impala Shell

Choosing the best (Hive, Pig, Impala)

Major Project – Putting it all together and Connecting Dots

Putting it all together and Connecting Dots,Working with Large data sets, Steps involved in analyzing large data

ETL Connectivity with Hadoop Ecosystem

How ETL tools work in big data Industry,Connecting to HDFS from ETL tool and moving data from Local system to HDFS,Moving Data from DBMS to HDFS,Working with Hive with ETL Tool,Creating Map Reduce job in ETL tool,End to End ETL PoC showing Hadoop integration with ETL tool.