TRAD Arabic-French Mailing lists Parallel corpus - Development set

ID:

ELRA-W0107

This is a parallel corpus of 10,000 words in Arabic and a reference translation in French. The source texts are emails collected from Wikiar-I, a mailing list for discussions about the Arabic Wikipedia. The collected emails are dated from 2004 to 2007.

The translation has been conducted following a strict protocol aimed at producing high quality translations.

The content is also translated into English (see ELRA-W0108).

This corpus was produced by ELDA within the PEA TRAD project supported by the French Ministry of Defence (DGA). It was used as a development set for the TRAD MT evaluation campaign.