The bigger our data sets, the harder it becomes to sift through them to find objective truth, and the higher the costs of getting it wrong. By lowering the bar to data science, open-source tools like Hadoop have actually increased our ability to misread our data, with serious consequences.