Abstract

In the context of the FASiL project, we have studied natural language interactions
in a unimodal (speech only) and multimodal (speech and graphics) interface to a personal information management database. We collected multilingual corpora to investigate these interactions in three languages:
Portuguese, English and Swedish. The corpora are used to train language models,
to update acoustic models, to study semantic concepts, multimodal interactions, and dialogue
management strategies.
The corpora are annotated in a uniform way, with timings, transcriptions, and
semantics. In this paper, we report on the structure and design of the corpora which are
now available via ELRA.