Data Science with Spark

This course will provide an understanding of Spark framework, RDD the core data structure of Spark and how to use them to transform data. The participants will gain understanding on how the data is stored and processed in a distributed manner. Participants will also understand the data science process, key machine learning technique and how to choose the right one for their use case. course will also involve understanding and applying MLlib library for implementing machine learning models.