This page tracks the users of Spark. To add yourself to the list, please email user@spark.apache.org with your organization name, URL, a list of which Spark components you are using, and a short description of your use case.

Spark powers NOW APPS, a big data, real-time, predictive analytics platform. We use Spark SQL, MLlib and GraphX components for both batch ETL and analytics applied to telecommunication data, providing faster and more meaningful insights and actionable data to the operators.

We are using Spark Core, Streaming, MLlib and Graphx. We leverage Spark and Hadoop ecosystem to build cost effective data center solution for our customer in teleco industry as well as other industrial sectors.

Bakdata – using Spark (and Shark) to perform interactive exploration of large datasets

Big Industries - using Spark Streaming: The Big Content Platform is a business-to-business content asset management service providing a searchable, aggregated source of live news feeds, public domain media and archives of content.

Formed by the creators of Apache Spark and Shark, Databricks is working to greatly expand these open source projects and transform big data analysis in the process. We're deeply committed to keeping all work on these systems open source.

Using Apache Spark for log processing and ETL. The data obtained feeds the recommender system powered by Spark MLLIB Matrix Factorization. We are evaluating the use of Spark Streaming for real-time analytics.

PredicitionIo - PredictionIO currently offers two engine templates for Apache Spark MLlib for recommendation (MLlib ALS) and classification (MLlib Naive Bayes). With these templates, you can create a custom predictive engine for production deployment efficiently.

Using Scala, Spark and MLLib for Radius Marketing and Sales intelligence platform including data aggregation, data processing, data clustering, data analysis and predictive modeling of all US businesses.