Distributional language models have consistently been demonstrated to capture semantic properties of words. However, research into the methods for evaluating the accuracy of the modeled semantics has been limited, particularly for less-resourced languages. This research presents three resources for evaluating the semantic quality of Finnish language distributional models: (1) semantic similarity judgment resource, as well as (2) a word analogy and (3) a word intrusion test set. The use of evaluation resources is demonstrated in practice by presenting them with different language models built from varied corpora.

About LinkĂ¶ping University Electronic Press LinkĂ¶ping University Electronic Press, LiU E-Press, is an Open Access publisher with the aim to make the research at LiU as visible as possible, internally, nationally and, most important, internationally and it is a part of the LiU marketing. LiU E-Press supply students and researchers with support and service about the publishing strategy at LiU.

We publish mainly research material produced at LinkĂ¶ping University and Region Ă–stergĂ¶tland such as Ph.D. and Licentiate theses, research articles, books, anthologies, chapter in books, reports and student theses.