The dataset has been built from official ATLAS simulation, with Higgs to tautau events mixed with different backgrounds. It has been used in the 2014 HiggsML challenge on Kaggle. It is hosted on the CERN Open Data Portal.

A version of the HiggsML dataset (used in the Kaggle Challenge in 2014) is provided. It contains a mixture of Higgs particles decaying into tau pairs and the principal background processes. One half of the data is unchanged but the other half has been artificially distorted in various ways.