Proceedings of the Third International Workshop on Free/Open-Source Rule-Based Machine Translation

Cristina España-Bonet, Aarne Ranta (Editors)

Anthology ID:
June 13-15
Gothenburg, Sweden
Bib Export formats:

pdf bib
Proceedings of the Third International Workshop on Free/Open-Source Rule-Based Machine Translation
Cristina España-Bonet | Aarne Ranta

pdf bib
The new machine translation: getting blood from a stone
Martin Kay

pdf bib
An IDE for the Grammatical Framework
John J. Camilleri

The GF Eclipse Plugin provides an integrated development environment (IDE) for developing grammars in the Grammatical Framework (GF). Built on top of the Eclipse Platform, it aids grammar writing by providing instant syntax checking, semantic warnings and cross-reference resolution. Inline documentation and a library browser facilitate the use of existing resource libraries, and compilation and testing of grammars is greatly improved through single-click launch configurations and an in-built test case manager for running treebank regression tests. This IDE promotes grammar-based systems by making the tasks of writing grammars and using resource libraries more efficient, and provides powerful tools to reduce the barrier to entry to GF and encourage new users of the framework.

pdf bib
Evaluating North Sámi to Norwegian assimilation RBMT
Trond Trosterud | Kevin Brubeck Unhammer

pdf bib
Choosing the correct paradigm for unknown words in rule-based machine translation systems
V. M. Sánchez-Cartagena | M. Esplà-Gomis | F. Sánchez-Martínez | J. A. Pérez-Ortiz

Previous work on an interactive system aimed at helping non-expert users to enlarge the monolingual dictionaries of rule-based machine translation (MT) systems worked by discarding those inflection paradigms that cannot generate a set of inflected word forms validated by the user. This method, however, cannot deal with the common case where a set of different paradigms generate exactly the same set of inflected word forms, although with different inflection information attached. In this paper, we propose the use of an n-gram-based model of lexical categories and inflection information to select a single paradigm in cases where more than one paradigm generates the same set of word forms. Results obtained with a Spanish monolingual dictionary show that the correct paradigm is chosen for around 75% of the unknown words, thus making the resulting system (available under an open-source license) of valuable help to enlarge the monolingual dictionaries used in MT involving non-expert users without technical linguistic knowledge.

pdf bib
An open-source toolkit for integrating shallow-transfer rules into phrase-based statistical machine translation
V. M. Sánchez-Cartagena | F. Sánchez-Martínez | J. A. Pérez-Ortiz

In this paper, we present an open-source toolkit to enrich a phrase-based statistical machine translation system (Moses) with phrase pairs generated from the linguistic resources of a shallow-transfer rule-based machine translation system (Apertium). A system built with this toolkit was not outperformed by any other participant in the shared translation task of the Sixth Workshop on Statistical Machine Translation (WMT 11) for the Spanish–English language pair.

pdf bib
A rule-based machine translation system from Serbo-Croatian to Macedonian
Hrvoje Peradin | Francis Tyers

This paper describes the development of a one-way machine translation system from SerboCroatian to Macedonian on the Apertium platform. Details of resources and development methods are given, as well as an evaluation, and general directives for future work.

pdf bib
Deep evaluation of hybrid architectures: use of different metrics in MERT weight optimization
Cristina España-Bonet | Gorka Labaka | Arantza Díaz de Ilarranza | Lluís Màrquez | Kepa Sarasola