Talend is an open source data integration platform. It provides various software and services for data integration, data management, enterprise application integration, data quality, cloud storage and Big Data. Talend is considered to be the next generation leader in the cloud and Big Data integration software. It helps companies in taking real-time decisions and become more data-driven. Using Talend, data becomes more accessible, its quality enhances and it can be moved quickly to the target systems.

Training Outcomes

Complete understanding of the ETL concepts and ability to solve the real-time business problems using Talend

Comprehensive knowledge of Talend Architecture and its various Components

Understanding of Big Data and Hadoop concepts and the benefits of integrating Talend with Hadoop

Easy integration and Access to Hadoop Ecosystem using Talend

Implementation of Talend with HDFS, Pig, and Hive (the most demanded and futuristic skills)

Rigorous involvement of an SME throughout the Talend Training to learn industry standards and best practices

COURSE OUTLINE

Topic : Overview of the concept of Data Warehouse.

Topic : · Dimensions, Hierarchy, Facts

Topic : DW models:- Star and Snowflake schemas.

Topic : SCD: - Slowly changing Dimension types and their maintenance.

Topic : Introduction to talend.

Topic : Installation of talend TOS_DI 6.0

Topic : ·Architecture of the Talend server client enterprise version and the comparison with the TOS.

Topic : Explaining the model design and it’s palette.

Topic : ·Explaining the job designer and it’s palette.

Topic : ·Explaining the project settings and perspective and views on the GUI.

Topic : Starting talend job design and development.

Topic : Basic understanding of various types of components in talend.

Topic : Importance of the schema in configuration and setting of the components.

Topic : Database connectivity testing.

Topic : Simple jobs with random data generation.

Topic : Job versioning

SQLAlchemy

SQLAlchemy is a popular SQL toolkit and Object Relational Mapper. It is written in Python and gives full power and flexibility of SQL to an application developer. It is an open source and cross-platform software released under MIT license. SQLAlchemy is famous for its object-relational mapper (ORM), using which, classes can be mapped to the database, thereby allowing the object model and database schema to develop in a cleanly decoupled way from the beginning.

Training Outcomes

Data access techniques

Use SQLAlchemy to access your DB

What an ORM is and why you should use it

Map classes to the database

Generate the database from your in-memory models

Use the more flexible SQLAlchemy core layer

COURSE OUTLINE

Module 1 : SQLAlchemy Core

Module 2 :SQLAlchemy ORM

Module 3 : Simple statements

Module 4 : Simple ORM

Module 5 : SQLAlchemy Philosophy

Module 6 :Advanced statements

Module 7 : ORM and relations

Module 8 : Advanced ORM

Splunk

Splunk, one of the highly-used data analysis software is utilized by global organizations for searching, analyzing and monitoring through huge amount of data. Other benefits of Splunk includes report generation, dashboard creation, and visualizing of data on real-time basis. The areas where it is mostly effective, are application management, security and web analytics.

Training Outcomes

Understand Splunk Power User/ concepts.

Apply various Splunk techniques to visualize data using different graphs and dashboards.

Implement Splunk in the organization to Analyze and Monitor systems for operational intelligence.

This training is designed to provide you the knowledge and skills that are required to become a successful Spark Developer using Python and prepare you for the Cloudera Hadoop and Spark Developer Certification Exam (CCA175). Throughout the PySpark Training, you will get an in-depth knowledge of Apache Spark and the Spark Ecosystem