Archives

Nina Zumel & John Mount from Win-Vector, LLC with “Preparing Data for Analysis using R” Workshop. you can download the materials at this GitHub link.

Excerpt from abstract….“Data quality is the biggest determiner of data science project success or failure. Preparing data for analysis is one of the most important, laborious, and yet neglected aspects of data science. Many of the routine steps can be automated in a principled manner. This workshop will lay out the fundamentals of preparing data and provide interactive demonstrations in the open source R analysis environment.”

Subscribe to blog via email

Latest News

Big data remains a rapidly evolving field with new applications and infrastructure appearing every year. In this talk, Matei Zaharia will cover new trends in 2016 / 2017 and how Apache Spark is moving to meet them. More