Historical Lexicon of Polish

Abstract

The ground truth material for Polish consists of books published from 1617 to 1756, the Digital Library of Polish and Poland-Related News Pamphlets from 1570 to 1728.

Prior to IMPACT, there were practically no historical corpora of Polish, which caused various problems from the very beginning. One of them was the lack of standards for representing old Polish texts in Unicode, as several necessary characters and ligatures are not provided, neither by the Unicode proper nor by Medieval Unicode Font Initiative.

The primary resource was the Internet dictionary we shall refer to as the “Late Middle Polish dictionary”, its official name being “The dictionary of the Polish language of the sixteenth and the first half of the seventeenth century”.

The lexicon is available under . The download of this resource is available for members of the Impact Centre of Competence and registered users. Please, log in to download it or discover here how to become a member . For further information, please contact us.

Language resources

The Impact Centre of Competence provide historical and named entities lexica for the following languages. In addition, we offer access to the different corpora.