The version 1.03 of the open database contains 1,207,293 brain signals of
2
seconds each, captured with the stimulus of seeing a digit (from 0 to 9)
and thinking about it, over the course of almost 2 years between 2014 & 2015, from a single Test Subject David Vivancos.
In 2018 we started sharing also a new open dataset
"IMAGENET" of The Brain .

We built our own tools to capture them, but there is no post-processing on
our side, so they come raw as they are read from each EEG device, in total
395,072,896 Data Points.

Feel free
to test any machine learning, deep learning or whatever algorithm you think it
could fit, we only ask for acknowledging the source and please let us know of
your performance!

We choose not to differentiate the signals into
training/test/validation sets at this
point so pick the distribution you prefer.

A small portion of the signals were captured without the stimulus of seeing
the digits for contrast, all are random actions not related to thinking or
seeing digits, you can decide to use them or not in your tests, they use the
code -1.

SIGNAL
DISTRIBUTION:

This is the distribution of the signals per device and digit:

Device/Digit

0

1

2

3

4

5

6

7

8

9

-1

Total

MindWave (MW)

5,531

5,498

5,517

5,416

5,381

5,568

5,476

5,552

5,545

5,450

12,701

67,635

EPOC (EP)

91,224

88,914

90,930

92,652

88,886

91,994

91,322

88,718

91,728

91,882

2,226

910,476

Muse (MU)

11,904

11,632

11,920

11,832

11,536

12,052

12,368

12,080

12,208

11,988

44,412

163,932

Insight (IN)*

6,305

6,740

6,535

6,605

6,620

6,460

6,425

6,470

6,590

6,500

0

65,250

Total

114,964

112,784

114,902

116,505

112,423

116,074

115,591

112,820

116,071

115,820

59,339

1,207,293

* Insight captures started in September 2015, so soon
will be updated with more brain signals, last update 10/14/2015 v1.05

** EPOC dataset updated to fix the channel sepparation
by comma and use dot for the decimals, instead of commas only , last update
06/16/2018 v1.01

FILE FORMAT:

The
data is stored in a very simple text format including:

[id]:
a numeric, only for
reference purposes.

[event]id,
a integer, used to distinguish the same event captured at different brain
locations, used only by
multichannel devices (all except MW).

[device]:
a 2
character string, to identify the device used to capture the signals, "MW" for
MindWave, "EP" for Emotive Epoc, "MU" for Interaxon Muse & "IN" for Emotiv
Insight.

[channel]:
a
string, to indentify the 10/20 brain location of the signal, with possible
values:

[code]:
a
integer, to indentify the digit been thought/seen, with possible values
0,1,2,3,4,5,6,7,8,9 or -1 for random captured signals not related to any of the
digits.

[size]:
a
integer, to identify the size in number of values captured in the 2 seconds of
this signal, since the Hz of each device varies, in "theory" the value is close
to 512Hz for MW, 128Hz for EP, 220Hz for MU & 128Hz for IN, for each of the 2
seconds.

[data]:
a
coma separated set of numbers, with the time-series amplitude of the signal, each
device uses a different precision to identify the electrical
potential captured from the brain: integers in the case of MW & MU or
real numbers in the case of EP & IN.

There is no headers in the files, every
line isa signal, and the fields are
separated by a tab

For example one line of each device could be (without
the headers)

[id]

[event]

[device]

[channel]

[code]

[size]

[data]

27

27

MW

FP1

5

952

18,12,13,12,5,3,11,23,37,36,26,24,35,42……

67650

67636

EP

F7

7

260

4482.564102,4477.435897,4484.102564…….

978210

132693

MU

TP10

1

476

506,508,509,501,497,494,497,490,490,493……

1142043

173652

IN

AF3

0

256

4259.487179,4237.948717,4247.179487,4242.051282……

BRAIN LOCATIONS:

Each
EEG device capture the signals via different sensors,
located in these areas of
my
brain, the color represents the device: MindWave, EPOC,
Muse,
Insight

This MindBigData The "MNIST" of Brain Digits is made available under the Open Database License: http://opendatacommons.org/licenses/odbl/1.0/. Any rights in individual contents of the database are licensed under the Database Contents License: http://opendatacommons.org/licenses/dbcl/1.0/