corpus alignés

English translation: aligned corpora

Advertisement

Login or register (free and only takes a few minutes) to participate in this question.

You will also have access to many other tools and opportunities designed for those who have language-related jobs (or are passionate about them). Participation is free and the site has a strict confidentiality policy.

Answers

13 mins confidence:

parallel corpora

Explanation:Parallel corpora are corpora where texts in two or more languages are aligned sentence by sentence or phrase by phrase.

Parallel Corpora
The term parallel corpora is typically used in linguistic circles to refer to texts that are translations of each other. And the term comparable corpora refers to texts in two languages that are similar in content, but are not translations. In order to exploit a parallel text, some kind of text alignment, which identifies equivalent text segments (approximately sentences), is a prerequisite for analysis. (Some researchers take the next step and aim for lexical alignment.)

Reference comments

19 hrs

Reference: DGT (not GDT!)

Reference information:As of November 2007, the European Commission's Directorate-General for Translation (DGT) made publicly accessible its multilingual Translation Memory for the Acquis Communautaire (the body of EU law) - a collection of parallel texts (texts and their translation, also referred to as bi-texts) in 22 languages. This is a page for technical users, where you will find a summary of this unique resource and instructions on where to download it and how to produce bilingual ***aligned corpora*** for any of the 231 language pairs (462 language pair directions).