In my previous blog, “Talend & Apache Spark: A Technical Primer” I walked you through how Talend Spark jobs equate to Spark Submit. In this blog post, I want to continue evaluating Talend Spark configurations with Apache Spark Submit. First, we are going to look at how you can map the options in the Apache Spark Configuration tab in the Talend Spark Job, to what you can pass as a...

Many businesses today are scrutinizing their operations to figure out how to join the digital transformation revolution. They understand that to become more competitive and customer-centric, they need processes that are flexible, integrated, insightful and scalable. They understand harnessing data and infusing business processes with it is the key to success. Unfortunately, poor data practices,...

We all know that enterprise data needs change constantly, and recently that change has come at an increasing pace. Companies that were once processing all their big data on-prem have suddenly moved into the cloud. Frameworks we used to know and love suddenly become obsolete. However, an interesting debate that still rages on is how to get data processed faster. There are generally two heralded ways of processing data today: Batch Processing Stream Processing...

In my previous blog, “Talend & Apache Spark: A Technical Primer” I walked you through how Talend Spark jobs equate to Spark Submit. In this blog post, I want to continue evaluating Talend Spark configurations with Apache Spark Submit. First, we are going to look at how you can map the options in the Apache Spark Configuration tab in the Talend Spark Job, to what you can pass as a...

Authored by Darius Kemeklis, Myers-Holum, Inc It’s hard to believe that Data Warehousing (DW) has been around since 1970 when Bill Inmon first defined the term. The 1990’s saw Bill Inmon and Ralph Kimball dueling on two different Data Warehousi...

You may have seen recently that the first stable version of Apache Beam (v.2.0) was recently released. Apache Beam is an advanced unified programming model designed for batch and streaming data processing. It’s extremely powerful and portable which is why we’ve been actively contributing to the project since the very beginning. Recently, we’ve integrated Apache Beam into Talend Data Preparation. François Lacas wr...

This year, the Apache Software Foundation announced that Apache Beam was established as a new top-level project. A little over two years ago, Google committed its Dataflow SDK to the Apache Software Foundation, which provides a programming model used to express Data processing pipelines very easily. Talend has a long history with the Apache Software Foundation (and already has committers on key E...

In this blog, I want to go over how to set up and deploy a Talend Spark Streaming job into a new Elastic Stack instance. Spark is the engine of choice for near real-time processing, not only for Talend but also for many organizations who have a need for large-scale lightning fast data processing. The Elastic Stack is a highly versatile and widely adopted suite of tools built for monitoring that works perfectly for this scenario....

Talend was recently recognized as a certified partner on the MapR Converged Data Platform. This is exciting news not only for Talend and MapR, but also for current and future customers who are looking at Talend and MapR as the solution to their big data challenges. Today we are going to look at how you can implement a real-time recommendation model usi...