Talend Resources

MapReduce Program

Use Hadoop's MapReduce program to manage big data.

Hadoop MapReduce (YARN in Hadoop) is the software framework for efficiently processing massive data sets, making it possible for organizations to work with and big data. By managing distributed processing of data on commodity hardware, the MapReduce program enables massive scalability across hundreds or thousands of servers in a Hadoop cluster. Organizations today use the MapReduce program to manage a wide variety of real time big data projects, from sensor analytics and social data mining to tracking capital markets and personalizing marketing messages for targeted audiences.

While the MapReduce program is critical to managing big data, it is also new enough that few developers are skilled in it – which presents a major dilemma for most organizations. Developers trained on the MapReduce program are, for the most part, are highly paid and already employed by major players in the big data arena. Few organizations have the resources to train existing developers and MapReduce skills, potentially limiting the organization's ability to use big data. To reap all the rewards big data has to offer, organizations need a solution that lets them work with big data using the people and skills they already have in place.

Talend: easy-to-use tools for Hadoop's MapReduce program.

Talend's software for big data redefines the developer skillset required to work with big data. By providing easy-to-use tools for the MapReduce program and other big data technologies, Talend's versatile big data platform lets organizations harness the power of big data without needing to invest heavily in training or in new teams of developers.

Using Talend, developers can quickly connect to hundreds of data sources, integrate data easily into massive data sets, and expertly manipulate and transform it to make faster and more informed business decisions.

Let developers program with MapReduce using existing skills.

With Talend, developers can use the MapReduce program to quickly load, extract and improve disparate data.

Talend includes more than 800 pre-built connectors that let developers quickly connect to any data source. They can use a graphical environment to easily map data sources and targets without needing to learn MapReduce skills or write complicated code. And they can work with Hadoop, NoSQL, Pig and other platforms to manipulate and analyze massive amounts of data in very little time.

In addition to tools for the MapReduce program, Talend includes components for other big data frameworks like HBase, Hive, HCatalog, Oozie, Sqoop and Pig. And Talend supports major Hadoop distributions, including the Hortonworks Data Platform, Amazon EMR, Cloudera, IBM PureData and others.