TRAD Chinese-English Web domain (blogs) Parallel corpus

ID:

ELRA-W0110

This is a parallel corpus of 15,000 characters in Chinese (equivalent to 10,000 words) and 2 reference translations in English. The source texts are blog articles dealing with various subjects such as economy, environment, society, technologies, etc. Articles are dated from June 2013. The translation has been conducted by two different translation teams following a strict protocol aimed at producing high quality translations.

The content has also been translated into French (see ELRA-W0109).

This corpus was produced by ELDA within the PEA TRAD project supported by the French Ministry of Defence (DGA). It was used as a test set for the TRAD evaluation campaign.