How It Works

Upload a Prediction Problem

The competition host provides the raw data and a description of the problem.

Predictive modeling requires participants to train algorithms that are evaluated against historical data. Highly performing models which show accuracy for past events can then be adopted by the host for prediciting the same behavior in the future.

For example, an insurance company might host a competition where participants build models that identify fraudulent claims in the dataset. The winning model(s) would then be deployed by the insurance company to predict which claims are fraudulent in the future.

Kaggle can offer consultative help to the host to prepare their data, frame the competition, anonymize the data, and set the rules for the challenge.

Submit

Over the course of several weeks or months, participants experiment with different statistical techniques and compete against each other to produce the best models. Their submissions are scored immediately (based objectively on their predictive accuracy, relative to a hidden solution file) and summarized on a live leaderboard for who is leading the competition.

Evaluate and Exchange

After the deadline passes, the host organization pays prize money in exchange for the winning model or models.