Talend Resources

Hortonworks for Hadoop

Hortonworks for Hadoop is an open source big data platform that specializes in development and support of Apache Hadoop for enterprises working in big data. Founded by engineers from the original Yahoo! Hadoop development team, Hortonworks for Hadoop is designed to facilitate integration of Hadoop with existing enterprise data. The Hortonworks Data Platform includes YARN supports multi-workload data processing in an array of methods – from batch to interactive to real-time – and is supported by solutions for integration, security, governance, and operations. By enabling enterprises to integrate Hadoop with their strategic technologies and existing team capabilities, Hortonworks makes it easier to manage sensor data, social data analytics, and other big data projects.

But working with Hortonworks for Hadoop can pose a challenge. It takes developers with deep skills in big data, NoSQL and Hadoop technologies to implement and manage Hortonworks, and because these technologies are so new, very few developers with adequate skills are available. Many organizations lack the data integration tools to accommodate all the new sources big data entails – their legacy integration tools simply don't cut it. And for most organizations, their existing infrastructure is insufficient to handle massive volumes of big data.

Fortunately, Talend provides a big data solution that lets organizations work with big data and Hortonworks for Hadoop using the developers and infrastructure they already have.

Talend makes it easy to work with Hortonworks for Hadoop.

Talend provides a powerful and versatile open source platform that enables developers to use Hortonworks for Hadoop to integrate and manipulate massive data sets. With Talend, developers can use their existing skillset to work with Hadoop, Pig, Hive, and other big data technologies, as well as NoSQL databases for Hadoop like MongoDB, Cassandra, HBase and others.

Talend also provides support for other Hadoop distributions, including Amazon EMR, MapR, IBM PureData, and the Cloudera distribution.

Integrate and manipulate big data in Hortonworks for Hadoop

Talend provides developers with all the tools they need to manage big data in Hortonworks for Hadoop:

With more than 800 pre-built connectors, developers can connect to any data source – including NoSQL databases – without needing to learn new skills.

An easy-to-use graphical environment allows developers to visually map data sources to targets without having to write complicated code.

Using the skills they already possess, developers can quickly load, extract and transform massive data sets to produce complex analytics.

Achieving massive scalability is simple – Talend automatically generates the underlying code for every data connection as new clusters are added to the system.