Questions During The Webinar

Q1 : Difference between Spark Streaming and Storm

Storm by default is ‘event based processing’ (one-event-at-a-time). Using Trident on top of Storm, we can do micro-batch processing.

Spark processes events in ‘micro batches’. For example I can define the ‘batch’ interval to be 5 seconds. Spark will process what ever number of events captured in that batch (could be none, one, ten or thousand!). Currently the lowest batch time is about half-a-second (500 ms)