Valency corpus

Valentsikorpus

The Valency Corpus consists of orthographic passages from the Postimees daily, whose emotional tone (positive, negative, ambiguous, neutral) has been identified by readers. The identification was done using the method of dominant opinion (Pennebaker et al. 1997). The corpus is mainly intended to train statistical models, but it can also be used for other purposes. Queries can be done by rubrics (“Opinion“, “Estonia“, “Culture“, “Sports“, “Abroad“, “Criminal“) as well as by the emotional tone (positive, negative, ambiguous, neutral).