Apache Spark architecture speeds data jobs, ousts MapReduce

Distinguished by fast in-memory processing, high-level machine learning libraries and integrated data streaming, open source Apache Spark architecture continues to find adherents both in Web startups and enterprise settings.At Databricks’ 2016 Spark Summit East in New York, users shared their reasons for employing the Spark architecture, which melds several useful APIs with a basic in-memory analytics engine. Their experiences and that of others add weight to recent Market Research Media estimates that, globally, the Spark market could reach $4.2 billion by 2020. Increasingly, Spark is at the heart of efforts to process data in motion, and fraud detection is one of the prime examples. Chris D’Agostino, vice president of technology at Capital One, based in McLean, Va., told a summit crowd his team is using Spark to harden its defenses…