Sameer is a Client Services Engineer at Databricks, where he works with customers on Apache Spark deployments. He has extensive industry expertise in the Hadoop ecosystem, Cassandra, Couchbase and general NoSQL domain. Prior to Databricks, Sameer worked 2 years as a freelance big data consultant + trainer globally and taught 100+ big data courses. Before that, Sameer was a Systems Architect at Hortonworks, an Emerging Data Platforms Consultant at Accenture R&D and a Enterprise Consultant for Symantec/VERITAS (specializing in VCS, VVR, SF-HA).

Find Sameer Farooqui at

Workshop:
Continuous Application with Apache Spark 2.0

Location: Seacliff CD

Day of week: Thursday

Duration: 9:00am -
12:00pm

A Continuous Application is an end-to-end application that reacts to data in real-time. But it is more than a typical event-based streaming app. Continuous applications capture input streams, blend them when static/offline data and sometimes apply machine learning to the combined data before serving the results back out. These modern applications support quick ad-hoc queries along with long running batch queries.

In today's session Sameer and Jules from the Evangelism team at Databricks will show you how to build a continuous application using a single API. Apache Spark 2.0 provides a high-level API to easily combine SQL, DataFrames, Streaming, Machine Learning and Graph Processing. Through hands on coding sessions and using demo prototype code, we will show you how a small team or single developer can build these sophisticated modern applications.