a longitudinal record of the early language development of 8 Cantonese-speaking children

subject.linguisticField

language_acquisition

subject.monoMultilingual

monolingual

subject.resourceSubject

corpus

description

The files contain episodes of conversational exchanges between children and adults, with each utterance represented in Chinese characters, romanizations as well as corresponding parts-of-speech tags. http://hum.shoppingshop.us/~cancorp/index.html

description.price
*This metadata is only as a guide

description.language

English

description.inputDevice

description.inputEnvironment

description.speakingStyle

description.speechMode

description.samplingRate

description.additionalData

publisher

Queries about the corpus should be directed to Thomas Lee (htlee@netvigator.com)

contributor

depositorthe Arts Faculty Server of the Chinese University of Hong Kong

childone year from the time when they were between one and a half to two years old

contributor.attribute.speaker.gender

male4

female4

contributor.attribute.speaker.number

date.created

1991-00-00 1994-00-00

date.issued

date.modified

2000-00-00

type

TextThe Chinese version, The romanized version,

type.discourseType

interactive_discourse

type.linguisticType

language_description

type.purpose

analysis

type.style

speech

type.form

unfixed

type.sentence

short

long

type.annotation

annotatedcoded according to the internationally accepted CHAT format (Codes for the Human Analysis of Transcripts) and tagged with 33 parts-of-speech labels, http://hum.shoppingshop.us/~cancorp/project/tags.html