Building Russian Word Sketches as Models of Phrases

The paper describes the writing of Sketch Grammar for the Russian language as a part of the Sketch Engine system. The Sketch Engine representing itself a corpus tool which takes as input a corpus of any language and corresponding grammar patterns. The system gives information about a word’s collocability on concrete dependency models, and generates lists of the most frequent phrases for a given word based on appropriate models. The papers deals with different approaches to writing rules for the grammar, based on morphological and syntactic information, and also with applying word sketches to the Russian language. The results show that word sketches and information about collocation behaviour could facilitate lexicographic work with the Russian language.