Friday, May 8, 2015

At the last Tech Talk Tuesday we took an overview of Python's Data Science related packages.

The key packages for numerical computing are Numpy, Scipy and Scikit-learn. The documentation for python is great, and makes presentations like this easy. These packages are loaded with code samples, even for complex concepts like Grid search and cross validation. The machine learning package, scikit-learn also has exercises below the code samples. Doing the exercises enforces the concepts, and is great preparation for solving problems like the ones in Kaggle competitions.

We also demoed iPython Notebooks, a fantastic way to create live data analysis documents.