Author: Sharat

By Sharat Shashi Nayar Operations Lead 3.1 trillion USD. That’s the IBM’s estimate on the cost of bad quality data in the US alone, in 2016. How do we define good, clean data? “Cleaning” refers to the removal of invalid data points from a given data. The end goal of data cleaning is not just to “clean up” the data off its unwanted elements, but also to bring a structure to the same,…