Machine Translation Summit (2007)



up

bib (full) Proceedings of Machine Translation Summit XI: Papers

The paper presents and evaluates a wide coverage, rule-governed machine translation system for Danish-English. Analysis and polysemy resolution are based on Constraint Grammar dependency trees. In its 85.000 lexeme lexicon, Dan2eng uses context-sensitive lexical transfer rules linking dependencies to semantic prototype conditions, syntactic function, definiteness etc. Dependency is further exploited instead of constituent bracketing to support syntactic movement rules. A robust derivational and compound analysis, as well as a separate NER module permit the handling of unrestricted text from a wide range of genres. The system averaged TER scores of 7 (BLEU 0.55-0.6) on student tasks, but performance varied widely against raw and edited Europarl references, respectively.




up

bib (full) Proceedings of the Workshop on Using corpora for natural language generation