Croatian-English Parallel Corpus

Hr-En p-corp

ID:

303

The Croatian-English Parallel Corpus (Hr-En p-corp) is a parallel unidirectional (hr to en) corpus of contemporary Croatian standard language collected from articles appearing in Croatia Weekly newspapers, published from 1998 to 2000. The corpus samples were obtained in digital form entirely, converted to XML, aligned using Vanilla Aligner, manually checked and stored in TMX format.