1. Introduction

MorphoDiTa: Morphological Dictionary and Tagger is an open-source tool for
morphological analysis of natural language texts. It performs morphological
analysis, morphological generation, tagging and tokenization and is distributed
as a standalone tool or a library, along with trained linguistic models. In the
Czech language, MorphoDiTa achieves state-of-the-art results with a throughput
around 10-200K words per second. MorphoDiTa is a free software under
Mozilla Public License 2.0 and the linguistic models
are free for non-commercial use and distributed under
CC BY-NC-SA license, although for some
models the original data used to create the model may impose additional
licensing conditions. MorphoDiTa is versioned using Semantic Versioning.

Copyright 2014 by Institute of Formal and Applied Linguistics, Faculty of
Mathematics and Physics, Charles University in Prague, Czech Republic.

3. Release

3.1. Download

MorphoDiTa releases are available on GitHub, both as
source code and as a pre-compiled binary package. The binary package contains Linux,
Windows and OS X binaries, Java bindings binary, C# bindings binary, and source
code of MorphoDiTa and all language bindings). While the binary
packages do not contain compiled Python or Perl bindings, packages for those
languages are available in standard package repositories,
i.e. on PyPI
and CPAN.

3.1.1. Language Models

To use MorphoDiTa, a language model is needed. The language models are available
from LINDAT/CLARIN infrastructure and described further
in the
MorphoDiTa User's Manual.
Currently the following language models are available:

3.2. License

MorphoDiTa is an open-source project and is freely available for non-commercial
purposes. The library is distributed under
Mozilla Public License 2.0 and the associated models and data
under CC BY-NC-SA, although
for some models the original data used to create the model may impose
additional licensing conditions.