Friday, May 3, 2013

A new RefCard from the GlowingPython!

This Refcard is a collection of code examples that introduces the reader to the principal Data Mining tasks using Python. In the RefCard you will find the following contents:

How to import and visualize data.

How to classify and cluster data.

How to discover relationships in the data using regression and correlation measures.

How to reduce the dimensionality of the data in order to compress and visualize the information it brings.

How to analyze structured data with networkx.

Each topic is covered with code examples based on four of the major Python libraries for data analysis and manipulation: numpy, matplotlib,sklearn and networkx. Here is a preview of the first two pages:

Thanks for putting this together. I just finished Coursera classes in Data Analysis (using R) and Machine Learning (using Octave) and this was a perfect way to get an overview of those topics in a language I like a LOT better. I found a few typos:1. Page 2, last paragraph, left side: three instances of the word "rows" instead of "columns".2. Page 4, third code block, right side: refers to "tt" instead of "c" in three lines of code.3. Page 5, first code block, right side: reads "import arrange" instead of "import arange".