CHILDES is the child language component of the TalkBank system. TalkBank is a system for sharing and studying conversational interactions.

The CanCorp was collected and transcribed by Thomas Lee and Colleen Wong. From the full corpus, 128 files were selected for further processing at HKU for a project including Lee, Fletcher, Weizman, Stokes, and Leung. This project clarified some CHAT codes and created a romanization system that allowed for easy morphological analysis. However, once a full MOR system was available for Cantonese in 2006, this romanization was removed and the main line text was replaced by the original Chinese characters from CanCorp.