Audiovisual

Abstract:
The GF Eclipse Plugin provides an integrated development environment (IDE) for developing grammars in the Grammatical Framework (GF).
Built on top of the Eclipse Platform, it aids grammar writing by providing instant syntax checking, semantic warnings and cross-reference resolution.
Inline documentation and a library browser facilitate the use of existing resource libraries, and compilation and testing of grammars is greatly improved through single-click launch conﬁgurations and an in-built test case manager for running treebank regression tests.
This IDE promotes grammar-based systems by making the tasks of writing grammars and using resource libraries more efficient, and provides powerful tools to reduce the barrier to entry to GF and encourage new users of the framework.

Abstract:
AceWiki-GF combines AceWiki with Grammatical Framework (GF) in order to make CNL-based semantic wikis multilingual.
The content languages of the wikis are defined by multilingual GF grammars. Each wiki article is stored in a language-neutral format but is viewable and editable via any of the languages supported by the grammar. The source of the grammar is integrated into the wiki and editable by the users.
The following screencast demonstrates the main features of AceWiki-GF using a geography wiki that is based on a multilingual version of Attempto Controlled English (ACE). The syntax of a large subset of ACE has been implemented in GF and ported to many languages using the GF Resource Grammar Library, resulting in the multilingual ACE-in-GF grammar. The ACE-based wikis offer automatic semantic consistency checking and question answering based on OWL reasoning via a meaning-preserving mapping from ACE to OWL.
<img src="/sites/default/files/Screencast wp11.png" width="80%" />
<p>You can download the screencast in the following formats:
<a href="http://attempto.ifi.uzh.ch/site/docs/screencast_acewikigf.webm">webm</a>,
<a href="http://attempto.ifi.uzh.ch/site/docs/screencast_acewikigf.mp4">mp4</a>.</p>

Abstract:
MOLTO‘s goal is to develop a set of tools for translating texts between multiple languages in real time with high quality. Prototypes covering up to 15 simultaneous languages are being built. As its main technique, MOLTO uses domain-specific semantic grammars and ontology-based interlinguas. These components are implemented in GF (Grammatical Framework), which is a grammar formalism where multiple languages are related by a common abstract syntax. MOLTO also develops hybrid methods where statistical machine translation is used for increasing robustness and for automatically extracting parts of translation grammars.
MOLTO runs from March 2010 to May 2013. The talk will explain the main ideas of MOLTO and show some highlights of its results from the first two years. Note: many ideas in MOLTO originate in the Multilingual Document Authoring project at XRCE in which Ranta worked in 1998-1999.

Abstract:
gfsage is an application of the Mathematical Grammar Library to multimodal input and output of software: it shows how it is possible to use natural language to issue commands to the computer algebra system Sage. The natural language answers can be also rendered aurally.

Abstract:
gfsage is an application of the Mathematical Grammar Library to multimodal input and output of software: it shows how it is possible to use natural language to issue commands to the computer algebra system Sage. Here we show how to customize separately the input and output language, e.g. ask in Spanish and let Sage answer in English.

Abstract:
The multilingual prototype for learning how to model word problems is being demonstrated in this video. This prototype shows how MOLTO technologies can be used to implement an assistant for solving word problems stated in natural languages, Spanish and English.

Abstract:
This Phrasebook is a program for translating touristic phrases between 14 European languages: Bulgarian, Catalan, Danish, Dutch, English, Finnish, French, German, Italian, Norwegian, Polish, Romanian, Spanish, Swedish. The Phrasebook is implemented in the Grammatical Framework programming language as the first demonstration for the MOLTO EU project (moltoproject.eu) and will be extended during the project.

Abstract:
In this paper we present an ongoing effort in the computational linguistic field to develop multilingual assistive technologies for mathematics. The approach is based on the Grammatical Framework used in combination with UMCL and semantic encodings of mathematics like OpenMath and MathML3. We review the formats for supporting alternative modalities of presentation of mathematics in order to evaluate the effectiveness of the proposed approach. Finally we describe a very initial prototype built using off-the-shelf speech synthesis to interface to the Sage suite of computer algebra systems.

Abstract:
The GF Eclipse Plugin provides an integrated development environment (IDE) for developing grammars in the Grammatical Framework (GF). Built on top of the Eclipse Platform, it aids grammar writing by providing instant syntax checking, semantic warnings and cross-reference resolution. Inline documentation and a library browser facilitate the use of existing resource libraries, and compilation and testing of grammars is greatly improved through single-click launch configurations and an in-built test case manager for running treebank regression tests. This IDE promotes grammar-based systems by making the tasks of writing grammars and using resource libraries more efficient, and provides powerful tools to reduce the barrier to entry to GF and encourage new users of the framework.

Abstract:
In this paper we report on our ongoing work in the EU project Multilingual Online Translation (MOLTO), supported by the European Union Seventh Framework Programme under grant agreement FP7-ICT-247914. More specifically, we present work workpackage 8 (WP8): Case Study: Cultural Heritage. The objective of the work is to build an ontology-based multilingual application for museum information on the Web. Our approach relies on the innovative idea of Reason-able View of the Web of linked data applied to the domain of cultural heritage. We have been developing a Web application that uses Semantic Web ontologies for generating coherent multilingual natural language descriptions about museum objects. We have been experimenting with museum data to test our approach and find that it performs well for the examined languages.

Abstract:
We describe a semantic wiki system with an underlying controlled natural
language grammar implemented in Grammatical Framework (GF).
The grammar restricts the wiki content to a well-defined
subset of Attempto Controlled English (ACE), and facilitates a precise
bidirectional automatic translation between ACE
and language fragments of a number of other natural languages,
making the wiki content accessible multilingually.
Additionally, our approach allows for automatic translation into the Web
Ontology Language (OWL), which enables automatic reasoning over the wiki content.
The developed wiki environment thus allows users to build, query and view
OWL knowledge bases via a user-friendly multilingual natural language interface.
As a further feature,
the underlying multilingual grammar is integrated into the wiki
and can be collaboratively edited to extend the vocabulary of the wiki or even
customize its sentence structures.
This work demonstrates the combination of the existing technologies of
Attempto Controlled English and Grammatical Framework, and is implemented
as an extension of the existing semantic wiki engine AceWiki.

Abstract:
In this paper we propose the components, the general architecture and
application areas for a controlled natural language based multilingual semantic
wiki. Such a wiki is a collaborative knowledge engineering environment that
makes its content available via multiple languages, both natural and formal, all of
them synchronized via their abstract form that is assigned by a shared grammar.
We also describe a preliminary implementation of such a system based on the
existing technologies of Grammatical Framework, Attempto Controlled English,
and AceWiki.

Abstract:
The goal of the tutorial is to give the knowledge needed for building GF applications or starting a new language implementation. The material is divided into three one-hour lectures:
The main concepts of GF and multilingual grammars
Building morphology implementations and lexica
Implementing syntactic rules for generation, parsing, and translation

Abstract:
This paper is devoted to present the Mathematics Grammar Library, a system for multilingual mathematical text processing. We explain the context in which it originated, its current design and functionality and the current development goals. We also present two prototype services and comment on possible future applications in the area of artificial mathematics assistants.

Abstract:
This volume is a collection of papers presented at FreeRBMT12, the Third International Workshop on Free/Open-source Rule-based Machine Translation, held in Gothenburg, Sweden, on 13-15 June 2012. The FreeRBMT series of workshops aims to bring together the experience of researchers and developers in the ﬁeld of rule-based machine translation who have decided to get on board the free/open-source train and are eﬀectively contributing to creating a commons of explicit knowledge: machine translation rules and dictionaries, and machine translation systems whose behaviour is transparent and clearly traceable through their explicit logic.
The workshops are also open for hybrid systems, which combine statistical and rule-based translation methods. The six papers in this volume address general methods and tools (Papers 1, 3, 4), systems for particular languages (Papers 2, 5), and evaluation (Paper 6). In addition to the contributed papers, the workshop featured an invited talk, The New Machine Translation — Getting blood from a stone by Martin Kay, as well as tutorials, demos, and discussions.

Abstract:
As the amount of cultural data available on the Semantic Web is expanding, the demand of accessing this data in multiple languages is increasing.
Previous work on multilingual access to cultural heritage information has shown that mapping from ontologies to natural language requires at least two different steps: (1) mapping multilingual metadata to interoperable knowledge sources; (2) assigning multilingual knowledge to cultural data.
This paper presents our work on making cultural heritage content available on the Semantic Web and accessible in 15 languages. The objective of our work is both to form queries and to retrieve semantic content in multiple languages. We describe our experiences with processing museum data extracted from two different sources, harmonizing this data and making its content accessible in natural language.

Deliverable

Abstract:
This phrasebook is a program for translating touristic phrases between 14 European languages included in the MOLTO project (Multilingual On-Line Translation): Bulgarian, Catalan, Danish, Dutch, English, Finnish, French, German, Italian, Norwegian, Polish, Romanian, Spanish, Swedish. A Russian version is not yet finished but will be added later. Also other languages may be added.
The phrasebook is implemented by using the GF [4] programming language (Grammatical Framework). It is the first demo for the MOLTO project, released in the third month (by June 2010). The first version is a very small system, but it will extended in the course of the project.
The phrasebook is available as open-source software, licensed under GNU LGPL, at http://code.haskell.org/gf/examples/phrasebook/[5].

Abstract:
This report describes the implementation of a large part of the Attempto Controlled English (ACE) syntax --- the subset of ACE that is accepted by the AceWiki semantic wiki system --- in Grammatical Framework (GF) and making it available via 10 languages that are supported by the GF Resource Grammar Library (RGL). As a result, ACE becomes available in multiple languages, making ACE-based knowledge representation possible also in languages other than English. Additionally, the GF-based implementation of the ACE language provides ACE users with new (GF-based) editing tools.

Abstract:
This report describes the user evaluation of two related software products --- ACE-in-GF and AceWiki-GF. The multilingual grammar ACE-in-GF is implemented in the Grammatical Framework (GF) with the goal to provide a multilingual interface to a large subset of Attempto Controlled English (ACE). We measure the accuracy with which the ACE-in-GF grammar translates ACE sentences to the other languages that it implements, and show that its translations are preferred to the translations obtained with a state-of-the-art statistical translation system. The semantic wiki engine AceWiki-GF enables collaborative knowledge engineering environments that are based on controlled natural language and implemented as GF grammars. We set AceWiki-GF up with the ACE-in-GF grammar and a geography domain lexicon, and then ask speakers of different languages to supply the wiki with geographical knowledge. We show that the automatic translation does not affect the basic functioning of the wiki: users who view the wiki content in a language different from that in which it was originally written are as likely to agree or disagree on the verity of the content than users who view the content in the same language.

Abstract:
Annual report on activities carried out in the framework of the MOLTO EU project. This report is designed for Web publishing, for a broad public outside the consortium. It documents the main results obtained by the MOLTO project during the first two years of activity and promotes the objectives of the project. MOLTO’s goal is to develop a suite of tools for translating texts between multiple languages in real time with high quality. MOLTO uses domain specific semantic grammars and ontology-based interlinguas implemented in GF [2] (Grammatical Framework), a grammar formalism where multiple languages are related by a common abstract syntax. Until now GF [2] has been applied in several small to-medium size domains, typically targeting up to ten languages, but during MOLTO we will scale this up in terms of productivity and applicability by increasing the size of domains and the number of languages.
MOLTO aims to make its technology accessible to domain experts who lack GF [2] expertise so that building a multilingual application will amount to just extending a lexicon and writing a set of example sentences. The most research-intensive parts of MOLTO are the two-way interoperability between ontology standards (such as OWL and RDF) and GF [2] grammars and the extension of rule-based translation by statistical methods. The OWL-GF [2] interoperability enables multilingual natural language based interaction with machine-readable knowledge while the statistical methods add robustness to the system when desired. MOLTO technology is released as open-source libraries for third-party translation tools and web pages and thereby fits into standard workflows.

Abstract:
The final dissemination and explotation report discusses how the project MOLTO has informed the public of the results. The industrial partners of the Consortium, Ontotext and Be Informed are the main contributors to the exploitation plan for the technologies developed by MOLTO. Exploitation of MOLTO aims to pursue sustainability for the tools and technologies and to further their uptake.

Abstract:
MOLTO’s goal is to develop a set of tools for translating texts between multiple languages in real time with high quality. MOLTO uses domainspecific semantic grammars and ontology-based interlinguas implemented in GF (Grammatical Framework), a grammar formalism where multiple languages are related by a common abstract syntax. GF has been applied in several small-to-medium size domains, typically targeting up to ten languages but MOLTO will scale this up in terms of productivity and applicability by increasing the size of domains and the number of languages. MOLTO aims to make the technology accessible for domain experts without GF expertise and to reduce the effort needed for building a translator to just extending a lexicon and writing a set of example sentences.
The most research-intensive parts of MOLTO are the two-way interoperability between ontology standards (OWL) and GF grammars, and the extension of rule-based translation by statistical methods. The OWL-GF interoperability will enable multilingual natural-language-based interaction with machine-readable knowledge while the statistical methods will add robustness to the system when desired.
MOLTO technology will be released as open-source libraries for standard translation tools and web pages and thereby fit into standard workflows.

Abstract:
During the review on March 20, 2012, an appendix was requested to better specify the methodology that MOLTO intends to adopt to carry evaluation of the work and results related to each workpackage. This document tries to clarify the goals and how they will be achieved in Workpackage 9.

Abstract:
This document is the written report of the first deliverable corresponding to WP7, <i>Case Study: Patents</i>.
It describes the preliminar prototype for patent translation and retrieval.
First, there is a general overview of the workpackage and we briefly summarize the scenarios considered within the prototype.
Then, we give the general layout of the prototype architecture, the demonstrator interface and the technologies integrated in the prototype.
Finally, we summarise the current status of the workpackage and the future directions for the final prototype.

Abstract:
This document is D4.3A, an annex to the D4.3 deliverable of WP4 in the scope of the MOLTO project.
It presents a final overview of the prototypes built in the scope of MOLTO, with respect to grammar-ontology interoperabilty. Also, it describes the further work on the topic, after M24 of the project, and gives some requested details on previous work. Next, the annex aims to address the reviewers' remarks and recommendations from their last report.
Finally, it serves to present a general overview of the achievements on the topic in MOLTO.

Abstract:
This report describes our work to extend the existing semantic wiki engine AceWiki — which is based
on the controlled natural language (CNL) ACE — with multilinguality features. In our approach, the
underlying multilingual CNL grammar is implemented in Grammatical Framework (GF). The grammar
facilitates precise automatic translation between different natural languages defined by the grammar,
making the wiki content multilingual. The underlying grammar itself is integrated into the wiki and can
be collaboratively edited. We discuss the current implementation of the system and its use cases.

Abstract:
The present document is Deliverable D7.2 of WP7.
It gives a description of the multilingual patents retrieval prototype produced in this workpackage and a brief user manual to access the demo.
The main highlights achieved in the prototype with respect to the beta version described in the Deliverable 7.1\cite{d71} are the following:
a) The demo allows for querying the system in the three languages addressed in this WP (English, French and German);
b) the patents in the database has original text in English, French and German and also the translated documents for all missing languages of each document;
c) the patent document translation can be done following a simple pipeline;
d) some improvements on the interface addressed several deficiencies detected during internal evaluation;
e) the new query library and its application to the patents use case have been presented at the Third Workshop on Controlled Natural Language (CNL 2012 (http://attempto.ifi.uzh.ch/site/cnl2012/), being held in Zurich at the end of August 2012.

Abstract:
In this deliverable we document the web services that have been provided by the MOLTO project. Many of them have been released with dedicated deliverables, for those we do not enter into the specific details. Instead we focus on the web services powering some of the MOLTO flagships at the end of the project's lifetime.

Abstract:
This document outlines the evaluation results from the verbalization techniques adopted in the verbalization component for the Be Informed Business Platform based on MOLTO Technologies in WP12 of the MOLTO project. First this document will focus on the evaluation of the adoption of GF technologies within our development department. Secondly results of the actual verbalizations will be presented and discussed.

Abstract:
This document presents the specification of the Knowledge Representation Infrastructure (KRI), which is based on pre-existing products. The KRI ensures a mature basis for storage and retrieval of both knowledge and content, covering all modalities of the data. The document provides descriptions of the technology building blocks, overall architecture, standards used, query languages and inference rules.

Abstract:
GF, Grammatical Framework, is a programming language for multilingual grammars, used in the MOLTO project to build translation systems. How to write GF grammars is specified in the numerous tutorials and manuals available via http://grammaticalframework.org. This report explains the compiler of the GF language: how GF source code is compiled to various formats usable in runtime systems, how the developers can test their grammars, how other formats (such as lexica and example sentences) can be converted to GF code, how to call the compiler in various ways.

Abstract:
This final report describes the evaluation of translation quality in five MOLTO use-cases that implement the Grammatical Framework (GF) for multilingual text generation and translation. Evaluations were made by using both automatically calculated metrics and manually by human volunteers.

Abstract:
In this paper we are concerned with the analysis of normative conflicts, or the detection of conflicting obligations, permissions and prohibitions in normative texts written in a Controlled Natural Language (CNL). For this we present AnaCon, a proof-of-concept system where normative texts written in CNL are automatically translated into the formal language CL using the Grammatical Framework (GF). Such CL expressions are then analysed for normative conflicts by the CLAN tool, which gives counter-examples in cases where conflicts are found. The framework also uses GF to give a CNL version of the counter-example, helping the user to identify the conflicts in the original text. We detail the application of AnaCon to two case studies and discuss the effectiveness of our approach.

Presentation

Abstract:
The demo presents a semi-supervised machine learning process, where new term dictionary candidates may be found in given text, by finding closest matches in previously known ontologies. (i.e. hierarchical vocabulary, term structure, usually industry or domain specific). We show how a corpus-harvested new term can be aligned with its closest matches in an prior existing term ontology. New term's functional and semantic environment is analyzed, and the feature variables extracted are compared to values of previously known terms. The user is given the supervision control to decide the best alignment match and thus refine the ontology incrementally.

Slide Presentation

Abstract:
This presentation will focus on the efforts of adoption GF for the business modelling tool from Be Informed, the Business Process Platfrom. As an industry partner of the MOLTO project we integrated GF in the product verbalisation component and evaluated its usability. Furthermore based on the obstacles we've encountered we developed a new approach that we think will increase chances of getting more traction from industry partners for GF.

Abstract:
In this talk we review the state of turning Grammatical Framework to a hybride combining lingustic knowledge with statistical evidence. Our statistical parser already outperforms the existing state of the art parsers for related formalisms. For one and the same grammar we show a speed-up of up to two orders of magnitude without sacrificing the quality. In addition the grammarians can still use the high-level GF language for developing their grammars. In MOLTO we have promised a scale of hundreds of lemmas while in this experiment we have shown that we can scale to thousands. On top of the new hybride we have built an experimental robust translator which is based entirely on GF.

Abstract:
Most people traditionally write GF grammars with generic text editors, but there are a number of motivations for using more sophisticated environments. A short presentation of the new GF plugin for Eclipse IDE, what it can do and where we would like to take it.

Abstract:
We present AceWiki-GF, a multilingual semantic wiki system where the content is expressed in GF-implemented controlled natural languages. Our main use case is a wiki that uses Attempto Controlled English (ACE). We implemented a large fragment of ACE using GF and its resource grammar library, making ACE available via multiple natural languages. We discuss the features of this wiki and its user evaluation.

Abstract:
We present a prototype of the hybrid English-French
translation system for patents, which comprises work on name entity
recognition, tokenizing numerals, a grammar for parsing and generating
claims and multilingual lexicon extraction. Perspectives on future
work will be also presented and discussed.

Abstract:
This presentation reports the work related to patent translation within MOLTO. The domain has been specially choosen for the opening of the main translator (GF) to non-restricted language. Here we show how we build a hybrid translation engine between GF and SMT and its applications. Also the main characteristics of the on-line patent retrieval and translation prototype are outlined.

Abstract:
Rule-based translation systems are usually adequate for close languages and/or restricted domains. MOLTO aims to widen this scope by hybridising a GF system with a statistical one. The domain of application, patents, can be considered a quasi-open domain, that is, a GF grammar cannot cover the whole language and statistical methods must go for coverage and robustness.
The first part of this talk introduces the Patents Case Study, the nature of the data, and its use within a GF translation system and a SMT system. Some preliminary results for these systems are shown. The second part describes various hybridisation methods that will be applied within the project and also similar approaches that we are carrying out for other language pairs.

Abstract:
In MOLTO we have exploited Grammatical Framework as means for generation of controlled query languages for interactive systems. The latter include information retrieval systems over semantic repositories and a tool for replying mathematical queries in natural language(WP4, WP6, WP7 and WP8). The focus of the presentation is the grammar-ontology interoperability, where we share our experience with generation of SPARQL and verbalizing RDF replies with the help of GF. We summarize the use cases of query technologies in MOLTO and demonstrate the prototype of the patents use case, where GF grammars are used for query language and for means of generation of SPARQL queries.

Abstract:
Translation systems in MOLTO are based on multilingual grammars written in GF (Grammatical Framework). The traditional environment available to GF grammar developers consist of the GF command shell, a generic text editor and the GF documentation. This is a simple and effective environment for the experienced grammar developer. To better support less experienced grammar developers, one of the goals of the MOLTO project is to create an IDE (Integrated Development Environment) for GF grammar development.
In this talk we will show a prototype of a web-based grammar development environment that provides the same core functionalities as the traditional environment: grammar editing, grammar compilation, error detection, testing and visualization. It enables the creation of web-based translation systems without installation of any software and thus gives quick access to GF to novice and occasional users.

Abstract:
Translation systems in MOLTO are based on multilingual grammars written in GF
(Grammatical Framework). The traditional environment available to GF grammar
developers consist of the GF command shell, a generic text editor and the GF
documentation. This is a simple and effective environment for the experienced
grammar developer. To better support less experienced grammar developers, one of
the goals of the MOLTO project is to create an IDE (Integrated Development
Environment) for GF grammar development, and an appropriate GF compiler API to
support the IDE.
In this talk we will first give a summary of the main functionalities of the
traditional GF tools: grammar compilation, error detection, testing, and
visualization. Then we will show a prototype of a web-based grammar development
environment and briefly discuss the wide range of options available when
building a GF IDE, and how this might affect what you require from the GF
compiler API.
The web-based environment will enable the creation of web-based translation
systems without installation of any software. It gives the quickest conceivable
access to GF to novice and occasional users. But the API is general enough even
to serve power users, who run the tools on their own favourite desktop
environment but want to profit from IDE facilities such as project management
and library browsing.

Abstract:
A 2-day GF course was given at Be Informed (Apeldoorn) 11-12 Dec 2012 by Kaarel Kaljurand from UZH. The course covered all the material of the existing GF tutorials and additionally presented some GF-based applications (multilingual CNL-based semantic wiki system developed in MOLTO, and well as GF-based speech recognition grammars and smart phone applications).

Abstract:
The goal of the MOLTO project is to produce automatic tools that will offer high quality, real-time translation in multiple languages with the intended target user of these tools being a producer of information wishing to offer online information in multiple languages. This scenario calls for a high standard of quality: a translation with both high accuracy of content and fluency of form, ready for publication. Furthermore, the MOLTO project aims to provide tools that are extensible to new domains within months or even days rather than the years normally required for building such a system, and without deep technological knowledge. These goals point to the specific aspects of quality that are central to the evaluation of the MOLTO tools: text quality in terms of fidelity and fluency of the translation, coverage of each domain in question and usability in terms of work needed to build and extend the system.
This presentation will discuss both manual and automated evaluation approaches and technology. The first part of the presentation illustrates manual approaches with a specific focus on a machine translation post-editing task carried out by the University of Helsinki and University of Turku. The second part of the presentation discusses automated evaluation technology and its implementation within The Asiya Open Toolkit for Automatic MT (Meta-)Evaluation developed at the Universitat Politècnica de Catalunya. Some preliminary evaluation results using this software on patent translations are given as well.

Abstract:
We present methods of lexicon extraction from different sources, for different languages and use cases.
Lexicon in GF is a part of the grammar, and it includes information of the inflection and the syntactical behaviour of the words. For that reason, lexicon extraction needs sources that provide the base form, the inflectional paradigm and the valency of the word. In addition to different sources, we consider different types of lexicons: monolingual and multilingual, as well as uni-sense (one lemma, one meaning) and multi-sense (entry for each sense of the lemma). The sources include WordNet, morphological lexicons, Wiktionary, domain ontologies and phrase tables for SMT systems.

Abstract:
In this presentation we describe our work on building an ontology-based grammar application for communication of museum content on the Semantic Web. We demonstrate the cross-language retrieval and representation system which we made available in 15 languages using Semantic Web technologies.

Abstract:
This demo is a short work-in-progress presentation of importing RDF-based term ontologies in the R statistical software. I introduce an interface toolkit which allows R scripts to read term ontologies for statistical analysis of term entries.
Second, I present an interactive prototype for statistical exploration of parallel term ontologies. For evaluation purposes, I show a sample of term hits in a text corpus. For disambiguation of these corpus hits, I explain an attempt to categorize the hits with syntactic dependency structures.
Finally, I discuss how these attempts benefit importing term ontologies into machine translation and affect developing language models.

Abstract:
This is a presentation of the main results from MOLTO's Work package 2, Grammar Tools. It covers the basic ideas of the GF programming language, its standard libraries (the Resource Grammar Library, available for 28 languages), development environments, documentation, and community.

Abstract:
Abstract from: http://www.flov.gu.se/forskning/forskningsomraden/logik/logikseminariet/
That languages have a common semantical foundation in logic is an old idea. In a letter to Mersenne in 1629, Descartes proposed a universal formal language that would serve as the basis of translation between languages; this idea was developed further by Leibniz in his "characteristica universalis". In modern times, systems like UNL and SUMO are used as interlinguas for machine translation, much in the same spirit. On the other hand, critics like Sapir and Whorf have rejected the idea of a common semantical foundation of languages, and the main stream of machine translation is based on statistics on co-occurrences of words and phrases, with no reference to semantics; Google translate (http://translate.google.com) is an extremely successful example of this approach. In the talk, I will summarize the main arguments underlying the different approaches to translation. Then I will present an approach which, in a sense, builds on Descartes's idea of a universal formal language, but takes into account the criticisms. The key idea is to distinguish between different levels of abstraction in language: on some levels, translation is clearly impossible, whereas on some other levels, it is clearly possible and the problems are technical rather than philosophical. The approach has been proven in several applications, and it is currently developed further in the European project MOLTO (Multilingual On-Line Translation, http://www.molto-project.eu).

Abstract:
We present the results of an evaluation of the translation quality in the MOLTO use cases, namely, the tourist phrasebook, ACE-in-GF and AceWiki, patents and mathematics. The results for cultural heritage are still pending. All evaluations were
made by native or near-native level speakers of each language, 34 evaluators in all. The MOLTO translations were compared to Google, Bing and Systran, and the evaluators were to choose the best, either to accept as it is or for post-editing. Both automatic evaluation metrics and the percentage of the evaluators' preferred translations suggest that MOLTO method fares significantly better for the task; however, there is variation between languages. The time and effort needed to fix the languages that get the poorest results is a factor which we continue to evaluate.

Abstract:
We will present a grammar library to express mathematics in diferent languages, both natural or artificial.
The latter case happens when interfacing mathematical software like computer algebra systems or automatic reasoners.
Two applications of such interfacing will be presented and another one on collaborative multilingual editing of mathematical content.

Software

Abstract:
The GF Eclipse Plugin provides an integrated development environment (IDE) for developing grammars in the Grammatical Framework (GF).
Built on top of the Eclipse Platform, it aids grammar writing by providing instant syntax checking, semantic warnings and crossreference resolution.
Inline documentation and a library browser facilitate the use of existing resource libraries, and compilation and testing
of grammars is greatly improved through single-click launch conﬁgurations and an in-built test case manager for running treebank regression tests.
This IDE promotes grammar-based systems by making the tasks of writing grammars and using resource libraries more efficient, and provides powerful tools to reduce the barrier to entry to GF and encourage new users of the framework.

Abstract:
This grammar has been developed originally for the semantic multilingual wiki system AceWiki-GF, as documented in Deliverable D11.3. The grammar can be used online at http://attempto.ifi.uzh.ch/acewiki-gf/.
It currently supports 3 languages: ACE, German and Spanish, where ACE is a formal language used for automated reasoning. A 500-word geography domain vocabulary has been created to describe Europe.
ACE is represented by two languages, Ace and Ape. Ape linearizations contain explicit lexical entries so that the ACE parser (APE) can be used to map the sentences of this grammar to OWL. The wiki shows how this mapping works.
The source for the grammar is distributed at Github at: https://github.com/Attempto/ACE-in-GF.

Abstract:
This software is part of the MOLTO Translators' Tools.
The demo installation can be tested at the address http://tfs.cc/pootle/
This version of Pootle includes proof-of-concept modifications to support the Grammatical Framework as a machine translation backend. To enable the support, add the tuple `('GF', '')` to the `MT_BACKENDS` configuration variable.
The GF backend requires a pgf-http server that provides translations for some GF grammars. The pgf-http server is included in the GF distribution. In the ADMIN->Grammars tab, add the service URLs for all the grammars that are needed in the translation projects. For example,<http://cloud.grammaticalframework.org/grammars/Foods.pgf> (without the brackets).
Once Pootle has a list of available grammars, they can be allocated to projects in Pootle's web administration interface. In the ADMIN->Projects tab, simply edit the "GF Grammars" column for each project.
When translating for a project with enabled GF grammars, a bug icon should appear among the other machine translation icons. When this icon is clicked, Pootle will call the GF web service to translate the current sentence with any of the grammars allocated for the current project. If the translation succeeds, the text will be replaced by its translation.
NOTE: Language codes are used to associate Pootle project languages to GF concrete grammars. Hence it is essential that both the projects and the grammars have proper ISO 639-1 language codes. A following hyphen and region code are allowed but ignored.
`git clone http://tfs.cc/git/pootle.git`

Abstract:
The MOLTO Phrasebook is a sample modular GF grammar for a multilingual traveller's phrasebook. It currently supports, with varying degree of precision, the languages: Eng, Bul, Cat, Dan, Dut, Fin, Fre, Ger, Hin, Ita, Lav, Nor, Pes, Pol, Ron, Rus, Spa, Swe, Tha, Urd. It has a module that handles disambiguation in Eng and in Ron.
The sources for this version are available from svn at:
URL: svn://molto-project.eu/wp10/phrasebook
Repository Root: svn://molto-project.eu
Repository UUID: 54d65b75-f25a-4862-968f-dc0a3298bc6b
Revision: 2438
It has been compiled on Grammatical Framework (GF) version 3.4-darcs.

Abstract:
The MathBar Grammar is a compiled PGF version of the Mathematical Grammar Library.
It supports the following languages: Fre, Cat, Spa, Eng and Fin. More languages are available but have not been checked against quality.
The source files are distributed via svn at
URL: svn://molto-project.eu/mgl
Repository Root: svn://molto-project.eu
Repository UUID: 54d65b75-f25a-4862-968f-dc0a3298bc6b
Revision: 2432

Abstract:
This little application generates gf files providing multilingual lexicon for a given list of terms. Files are designed as an extension to Phrasebook, but this may be easily modified. The running example is fish names, README file explains how to obtain vocabulary from any other domain.

Abstract:
The MOLTO Google+ Community began when we decided to hold online seminars to present and coordinate the work on the final flagships. The Google hangouts allowed us to stream live our presentations and show live the demo to those in the team interested. In future, this can serve as a vehicle to still keep in touch, even informally. On the MOLTO Community stream we publish items that we feel are related to the work done during the project.