twitter

In this post, we will walk you through performing common ETL tasks on Hadoop using the open-source Cask Data Application Platform. A typical ETL pipeline consists of a data source, followed by a transformation, used for filtering or cleaning data, ending in a data sink. For example, an organization might take a snapshot of their … Read more

Application Templates are the major new feature added in CDAP 3.0. In this blog post we will introduce what they are, and the problems they solve. While building applications, we noticed that CDAP users would sometimes end up deploying multiple applications that all solved the same type of problem. Their code was mostly the same; … Read more