Apache Spark

In this video from the GOTO Chicago 2017 conference, Dean Wampler (Big Data Architect at Lightbend & O’Reilly Author) explains the streaming Data Architecture and helps to demonstrate the key features to consider when choosing which streaming products to utilize. He speaks about Apache Beam, Flink, Akka Streams, Kafka Streams and Apache Spark and addresses determining factors like latency and volume.

“Spark Summit Europe 2015” wrapped up yesterday in Amsterdam. Here’s a couple of the top videos from the last few days. Juliet Houghland, a Data Scientist from Cloudera talks about the client-side need/demand for PySpark, and Aaron Davidson from Databricks talks about some more recent problems he sees emerging. Watch Videos