Tools

"... We show that it is possible to automati-cally detect machine translated text at sen-tence level from monolingual corpora, us-ing text classification methods. We show further that the accuracy with which a learned classifier can detect text as ma-chine translated is strongly correlated with the trans ..."

with the translationquality of the machine translation system that generated it. Fi-nally, we offer a generic machine transla-tionquality estimation technique based on this approach, which does not require ref-erence sentences. 1

"... Statistical Machine Translation (SMT) has been successfully employed to support translation of film subtitles. We explore the integration of Constraint Grammar corpus annotations into a Swedish–Danish subtitle SMT system in the framework of factored SMT. While the usefulness of the annotations is li ..."

is limited with large amounts of parallel data, we show that linguistic an-notations can increase the gains in transla-tionquality when monolingual data in the target language is added to an SMT system based on a small parallel corpus. 1

"... Unknown words are a well-known hindrance to natural language applications. In particu-lar, they drastically impact machine transla-tion quality. An easy way out commercial translation systems usually offer their users is the possibility to add unknown words and their translations into a dedicated le ..."

Unknown words are a well-known hindrance to natural language applications. In particu-lar, they drastically impact machine transla-tionquality. An easy way out commercial translation systems usually offer their users is the possibility to add unknown words and their translations into a dedicated

-ing is more efficient than CKY decoding, it is unable to capture some hierarchical phrase alignments reachable using CKY decoding and suffers from lower transla-tionquality as a result. This paper in-troduces two improvements to LR decod-ing that make it comparable in translationquality to CKY

such information. We demonstrate that our system based on multi bottom-up tree transducers, which can natively handle discontinuities, can avoid the large transla-tionquality deterioration, achieve the best performance of all classical syntax-based translation systems, and close the gap to phrase

by
George Foster, Pierre Isabelle, Roland Kuhn
- In Proceedings of AMTA, 2010

"... Machine Translation traditionally treats doc-uments as sets of independent sentences. In many genres, however, documents are highly structured, and their structure contains infor-mation that can be used to improve transla-tion quality. We present a preliminary ap-proach to document translation that ..."

Machine Translation traditionally treats doc-uments as sets of independent sentences. In many genres, however, documents are highly structured, and their structure contains infor-mation that can be used to improve transla-tionquality. We present a preliminary ap-proach to document translation

"... In this paper, we propose an example-based decoder for a statistical machine translation (SMT) system, which is used for spoken language machine translation. In this way, it will help to solve the re-ordering problem and other problems for spoken language MT, such as lots of omissions, idioms etc. T ..."

In this paper, we propose an example-based decoder for a statistical machine translation (SMT) system, which is used for spoken language machine translation. In this way, it will help to solve the re-ordering problem and other problems for spoken language MT, such as lots of omissions, idioms etc

"... We demonstrate the feasibility of using unsupervised morphological segmentation for dialects of Arabic, which are poor in linguistics resources. Our experiments us-ing a Qatari Arabic to English machine translation system show that unsupervised segmentation helps to improve the transla-tion quality ..."

We demonstrate the feasibility of using unsupervised morphological segmentation for dialects of Arabic, which are poor in linguistics resources. Our experiments us-ing a Qatari Arabic to English machine translation system show that unsupervised segmentation helps to improve the transla-tionquality

"... Given multiple translations of the same source sentence, how to combine them to produce a translation that is better than any single system output? We propose a hier-archical system combination framework for machine translation. This framework inte-grates multiple MT systems ’ output at the word-, p ..."

multiple systems are addi-tionally selected based on N-gram language models trained on word/word-POS mixed stream, which further improves the transla-tionquality. We consistently observed sig-nificant improvements on several test sets in multiple languages covering different gen-res. 1

by
Mikel L. Forcada
- In Proceedings of the 11th Conference on Theoretical and Methodological Issues in Machine Translation (TMI 2007, 2007

"... This paper focuses on the infer-ence of structural transfer rules for shallow-transfer machine translation (MT). Transfer rules are generated from alignment templates, like those used in statistical MT, that have been extracted from parallel cor-pora and extended with a set of re-strictions that con ..."

-strictions that control their applica-tion. The experiments conducted using the open-source MT platform Apertium show that there is a clear improvement in translationquality as compared to word-for-word trans-lation (when no transfer rules are used), and that the resulting transla-tionquality is very close to the one