Descriptive Analytics-Part 1: Data Formatting Exercises

Descriptive Analytics is the examination of data or content, usually manually performed, to answer the question “What happened?”.

In order to be able to solve this set of exercises you should have solved the ‘part 0’ of this series, in case you haven’t you can find the solutions to run them in your machine here. This is the second set of exercise of a series of exercises that aims to provide a descriptive analytics solution to the ‘2008’ data set from here. This data set which contains the arrival and departure information for all domestic flights in the US from 2008 has become the “iris” data set for Big Data. In the exercises below we will try to make the format of the dates adequate for further processing. Before proceeding, it might be helpful to look over the help pages for the str_pad, substring, paste, chron, head.

For this set of exercises you will need to install and load the packages stringr, chron.