Corpora

My work over the last several years includes the creation of the phonetic transcription component of the Weist-Jarosz Corpus of Child Polish, which is freely available as part of the CHILDES project on child language. The corpus includes audio recordings of spontaneous productions of four children acquiring Polish and their interactions with their primary caregivers.

The audio-linked phonetic and orthographic transcripts of the child speech can be viewed online at:

Please let me know if you use the data for any projects. I would love to hear what it is being used for. If you use this corpus in published materials, please cite the following two papers for the phonological component of the corpus: