Evaluation of Uryupina’s Coreference Resolution Features for Polish

Abstract

Coreference is usually defined as phenomenon consisting in different expressions relating to the same referent. Therefore automatic coreference resolution is an extremely difficult and complex task. It can be approached in two different ways: using rule-based tools or machine learning. This article is dedicated to the second approach and describes an evaluation of a set of surface, syntactic, discourse, salience and anaphoric features proposed by Uryupina and their usefulness for coreference resolution in Polish texts.

Keywords

The study was cofounded by the European Union from resources of the European Social Fund. Project PO KL “Information technologies: Research and their interdisciplinary applications”, Agreement UDA-POKL.04.01.01-00-051/10-00 and the Computer-based methods for coreference resolution in Polish texts project financed by the Polish National Science Centre (contract number 6505/B/T02/2011/40).