Database of Lithuanian Nominal Phrases Dictionary is the first corpus-based dictionary of Lithuanian multi-word units.
The lexicon reflects a big variety of multi-word units: collocations, phrasal combinations, longer pieces of text (sometimes even several sentences).
The identification of the nominal phrases is based on gravity counts in the 100m words Contemporary Corpus of Lithuanian Languages. The corpus reflects the written Lithuanian in the period of 1991-2002.
The list of automatically identified MWU‘s was edited by linguists, only leaving meaningful and grammatically correct phrases that include at least one noun.
The dictionary comprises almost 69 thousand nominal phrases.

Query

Whole words

You can search for a specific word (word form), e.g. kompiuteris or kompiuterio, etc.

Start of the word

Adding of the beginning of the word, e.g. kompiuteris, will be queried with the phrase forms kompiuteris, kompiuteriais, kompiuterijos, kompiuterinių, etc.

Middle of the word

Adding of the part of the word, e.g. kel, will be queried with the phrase forms keliu, iškelta, paskelbtas, pakelį, runkeliai, etc.

End of the word

Adding of the end of the word, e.g. ris, will be queried with the phrase forms seseris, sandoris, duris, kuris, numeris, etc.

Phrase

You can search for one or more of the word(s). You can search for parts of words indicating the stars (e.g. * namnam ** nam *).
Finding of two or more word phrases, you can specify only those parts of those words (e.g. nam * gamyb* ).
You can search for a word with spaces, e.g., space, namas space, then there will queried phrases with words befor and after namas.

Lemma

Adding of the lemma of the word, e.g. ministras will be queried phrases with other forms of this lemma, e.g. ministro, ministrą, ministru, etc.

First Word

You can search for the first word of the phrase, e.g., adding atominė, will be queried as atominė bomba, atomine elektrinė, atominė energetika, etc.