Applications written in Caml

Scientific software

Mana is a Japanese input engine, which converts phonetic
representation to proper written forms. Mana models Japanese language
as a HMM (Hidden Markovian Model), and its output is quite natural
even for a long sentence. In addition to Japanese, mana can be used
for Chinese once we have a necessary data.

Courses

Libraries

Native OCaml libraries

This package contains the OCaml Interface to WordNet 2.1. It enables Ocaml programs to use
the Wordnet dictionary for (english) word forms and meanings. This library directly parses the WordNet dictionary files, and does not depend on any other libraries. This library requires WordNet 2.1 to be installed (WordNet is available as a free download).

This library can be used to segment Newspaper style (english) text articles into sentences. It uses the Ocaml lexer with a simple set of lexical rules to classify those periods ('.') in the text that end sentences.

This toolkit offers a few fundamental structures useful for computational linguistics. It is implemented in Pidgin ML, which is a core subset of the Objective Caml programming language under the so-called revised syntax.