Contents

The infinite MNIST dataset

Why settle for 60000 MNIST training examples when you can have one trillion?

The MNIST8M dataset was generated using the elastic deformation code originally written
for (Loosli, Canu, and Bottou, 2007). Unfortunately the
original MNIST8M files were accidentally deleted from the NEC servers a couple weeks ago.
Instead of regenerating the files, I have repackaged the generation code in
a convenient form. You can now generate arbitrary amounts of pseudo-random
MNIST training examples. You can even use this code to generate your training
data on the fly. We call this the infinite MNIST dataset.