There are two clear trends in the big-data ecosystem: the growth of machine learning use cases that leverage large distributed data sets, and the growth of Sparkâs Machine Learning libraries (often referred to as MLlib) for these use cases. In fact, Sparkâs MLlib library is arguably the leading solution for machine learning on large distributed data sets. Intel and Cloudera have collaborated to speed up Spark’s ML algorithms, via integration with Intel’s Math Kernel Library (IntelÂ® MKL). Read More