Manually aligned CES Polish-English parallel corpus

CESCorpus

ID:

441

A corpus of the Centre for Eastern Studies (CES) texts. This resource contains 56 Polish-English texts (6 CES reports, 28 issues of CES studies and 22 issues of the CES publication "Point of View") licensed under the CC-BY-NC license. The texts have been aligned manually on the sentence level using the MemoQ software. The resource is provided as TEI P5-compliant XML files with custom extensions and in the XLIFF and TMX formats.

Creation mode details: The texts were acquired as PDF and converted to plain text. Segmentation and manual alignment were performed using memoQ. Care was taken to represent all non-trivial translation equivalence types.