Croatian National Corpus v3.0

HNK v3.0

ID:

315

The Croatian National Corpus (HNK) is a representative corpus of contemporary Croatian standard language written texts published since 1990. The corpus is automatically lemmatised and MSD tagged. The documents are annotated with their genre, type and other information. The whole corpus is composed of faction, fiction and mixed texts. This is a pseudocorpus, only the query interface using Bonito2 web interface is available, while the original texts cannot be distributed for copyright reasons. Bonito2 web interface gives opportunities to issue complex queries due to elaborated query language resulting not only in concordances, but also in word-lists, collocations and other types of distributional data etc. of tokens, lemmas and/or MSDs. This version of HNK features Bonito2 web interface and additional texts