Data cleaning

Data Cleaning or data cleansing is a process of identifying, correcting and deleting the false data (from a database) in order to create a usable and adequate database. There are different opportunities to accomplish the data cleaning: filtering and correcting (e.g: typo errors) is possible by automated tools or manually. Data structuring is also main part of the process.

Data cleaning is an important and very time-consuming task of a Data Scientist.