Pentaho Tackles Big Data Challenges by Leveraging Amazon Web Services

Pentaho Corporation, the leader in open source business intelligence (BI) and data integration, today announced plans to enable Pentaho Data Integration for Hadoop to easily integrate with Hadoop data stored in Amazon Elastic MapReduce. As a result, Amazon Web Services LLC (AWS) customers can leverage Pentaho’s ETL capabilities to deploy a hybrid data model whereby they can easily move data between Amazon Elastic MapReduce and databases, data warehouses and other cloud based and on-premise data stores.

Pentaho’s offering for Amazon Elastic MapReduce is a tightly integrated report designer that will give AWS customers the option to build production or ad hoc reports from data spanning AWS and on-premise data sources. Available in the November timeframe, this will be a pay-as-you-go utility offering that leverages the elastic nature of Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Elastic MapReduce.

Amazon Elastic MapReduce is a web service that enables businesses, researchers, data analysts, and developers to easily and cost-effectively process vast amounts of data. It utilizes a hosted Hadoop framework running on the web-scale infrastructure of Amazon EC2 and Amazon Simple Storage Service (Amazon S3).

Using Amazon Elastic MapReduce, organizations can instantly provision as much or as little capacity as they like to perform data-intensive tasks for applications such as web indexing, data mining, log file analysis, machine learning, financial analysis, scientific simulation, and bioinformatics research. With Amazon Elastic MapReduce users focus on analyzing their data without having to worry about time-consuming set-up, management or tuning of Hadoop clusters or the compute capacity upon which they sit.

TOPICS

ITBriefcase brought to you by: Virtual Star MediaCopyright by IT Briefcase - IT Briefcase is a targeted online publication that attracts qualified business and IT professionals who are actively researching business integration solutions. Some of the topics we cover include BI, BPM, Cloud Computing, Data Storage, Health IT and Open Source. A full list of the topics we cover can be found on the right hand side of our website.