Presentation

It consists of 10.000 pairs of 20 seconds audio chunks that are very precisely aligned (i.e. remain simultaneous during the whole duration when played at the same time). Each pair also comes with a stereo mix of the to chunks. These pairs of audio samples were extracted from a real-world corpus of french radio broadcast stations, using the method described in the following article, and its complementary page about audio temporal alignment.

To improve the durability and consistancy of this corpus, we followed the documentation protocol advised in the recent paper by Peeters and Fort, where they propose a methodology for providing new corpora to the MIR community:

Examples

As explained in the corpus XML description, each item comes in the form of three audio files :

The scaled file is the original item, as learnt in the database, after a time-warping (hence the "scaled" attribute) to synchronize it with the stream file.

The stream file is the occurrence of the item of the test audio stream (radio broadcast recordings).

The mix file combines the two previous audio chunks in a stereo mix in order to assess the synchronicity.

The table hereafter provides a few examples of the corpus content.

Item

Scaled (Mono)

Stream (Mono)

Mix (Stereo)

oc00002

oc00003

oc00004

oc00005

oc00006

Download

The XML description and annotations are freely available on github athttps://github.com/hibooo/syncoccur.
Please contact the author (see contact panel on the right) to ask for the audio part of the corpus.