ACL 2010

 

       Joint Fifth Workshop on

Statistical Machine Translation

        and MetricsMATR

 

          Proceedings of the Workshop

 

                         15-16 July 2010

                       Uppsala University

                       Uppsala, Sweden

           Programme and Table of Contents

Introduction

Chris Callison-Burch, Philipp Koehn, Christof Monz, Kay Peterson, Omar Zaidan ……………….. iii [PDF]

 

Thursday, July 15, 2010

 

Full Paper Session 1

A Semi-Supervised Word Alignment Algorithm with Partial Manual Alignments

Qin Gao, Nguyen Bach and Stephan Vogel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1  [PDF]

Fast Consensus Hypothesis Regeneration for Machine Translation

Boxing Chen, George Foster and Roland Kuhn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 [PDF]

 

Shared Translation Task

Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation

Chris Callison-Burch, Philipp Koehn, Christof Monz, Kay Peterson, Mark Przybocki and Omar

Zaidan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  17 [PDF]

 

Poster Session 1: Translation Task

LIMSI’s Statistical Translation Systems for WMT’10

Alexandre Allauzen, Josep M. Crego, ˙Ilknur Durgar El-Kahlout and François Yvon . . . . . . . . . . . 54 [PDF]

2010 Failures in English-Czech Phrase-Based MT

Ondřej Bojar and Kamil Kos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 [PDF]

An Empirical Study on Development Set Selection Strategy for Machine Translation Learning

Hui Cong, Zhao Hai, Lu Bao-Liang and Song Yan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 [PDF]

The University of Maryland Statistical Machine Translation System for the Fifth Workshop on Machine Translation

Vladimir Eidelman, Chris Dyer and Philip Resnik . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .72 [PDF]

Further Experiments with Shallow Hybrid MT Systems

Christian Federmann, Andreas Eisele, Yu Chen, Sabine Hunsicker, Jia Xu and Hans Uszkoreit . 77 [PDF]

Improved Features and Grammar Selection for Syntax-Based MT

Greg Hanneman, Jonathan Clark and Alon Lavie . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .82 [PDF]

FBK at WMT 2010: Word Lattices for Morphological Reduction and Chunk-Based Reordering

Christian Hardmeier, Arianna Bisazza and Marcello Federico . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88 [PDF]

The RWTH Aachen Machine Translation System for WMT 2010

Carmen Heger, Joern Wuebker, Matthias Huck, Gregor Leusch, Saab Mansour, Daniel Stein

and Hermann Ney . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .93 [PDF]

Using Collocation Segmentation to Augment the Phrase Table

Carlos A. Henríquez Q., Marta Ruiz Costa-jussà, Vidas Daudaravicius, Rafael E. Banchs

and José B. Mariño . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 98 [PDF]

The RALI Machine Translation System for WMT 2010

Stéphane Huet, Julien Bourdaillet, Alexandre Patry and Philippe Langlais . . . . . . . . . . . . . . . . . . . 103 [PDF]

Exodus - Exploring SMT for EU Institutions

Michael Jellinghaus, Alexandros Poulis and David Kolovratnik . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 [PDF]

More Linguistic Annotation for Statistical Machine Translation

Philipp Koehn, Barry Haddow, Philip Williams and Hieu Hoang. . . . . . . . . . . . . . . . . . . . . . . . . . . .115 [PDF]

LIUM SMT Machine Translation System for WMT 2010

Patrik Lambert, Sadaf Abdul-Rauf and Holger Schwenk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121 [PDF]

Lessons from NRC’s Portage System at WMT 2010

Samuel Larkin, Boxing Chen, George Foster, Ulrich Germann, Eric Joanis, Howard Johnson

and Roland Kuhn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  . 127 [PDF]

Joshua 2.0: A Toolkit for Parsing-Based Machine Translation with Syntax, Semirings, Discriminative

Training and Other Goodies

Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Ann Irvine, Sanjeev Khudanpur,

Lane Schwartz, Wren Thornton, Ziyuan Wang, Jonathan Weese and Omar Zaidan . . . . . . . . . . . .133 [PDF]

The Karlsruhe Institute for Technology Translation System for the ACL-WMT 2010

Jan Niehues, Teresa Herrmann, Mohammed Mediani and Alex Waibel . . . . . . . . . . . . . . . . . . . . . . 138 [PDF]

MATREX: The DCU MT System for WMT 2010

Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K. Srivastava,

Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L. Forcada and Andy Way . . . . . . . . . . . .. . 143 [PDF]

The Cunei Machine Translation Platform for WMT ’10

Aaron Phillips . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149 [PDF]

The CUED HiFST System for the WMT10 Translation Shared Task

Juan Pino, Gonzalo Iglesias, Adrià de Gispert, Graeme Blackwood, Jamie Brunning

and William Byrne . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155 [PDF]

The LIG Machine Translation System for WMT 2010

Marion Potet, Laurent Besacier and Hervé Blanchon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161 [PDF]

Linear Inversion Transduction Grammar Alignments as a Second Translation Path

Markus Saers, Joakim Nivre and Dekai Wu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167 [PDF]

UPV-PRHLT English–Spanish System for WMT10

Germán Sanchis-Trilles, Jesús Andrés-Ferrer, Guillem Gascó, Jesús González-Rubio, Pascual Martínez-Gómez,

Martha-Alicia Rocha, Joan-Andreu Sánchez and Francisco Casacuberta . . . . . . . . . . . . . …. . . . . 172 [PDF]

Reproducible Results in Parsing-Based Machine Translation: The JHU Shared Task Submission

Lane Schwartz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177 [PDF]

Vs and OOVs: Two Problems for Translation between German and English

Sara Stymne, Maria Holmqvist and Lars Ahrenberg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183 [PDF]

To Cache or Not To Cache? Experiments with Adaptive Models in Statistical Machine Translation

Jörg Tiedemann . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189 [PDF]

Applying Morphological Decompositions to Statistical Machine Translation

Sami Virpioja, Jaakko Väyrynen, André Mansikkaniemi and Mikko Kurimo . . . . . . . . . . . . . . . . . 195 [PDF]

Maximum Entropy Translation Model in Dependency-Based MT Framework

Zdeněk Žabokrtský, Martin Popel and David Mareček . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .201 [PDF]

UCH-UPV English–Spanish System for WMT10

Francisco Zamora-Martínez and Germán Sanchis-Trilles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207 [PDF]

Hierarchical Phrase-Based MT at the Charles University for the WMT 2010 Shared Task

Daniel Zeman . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212 [PDF]

 

Invited Talk – Hermann Ney [not available]

 

Full Paper Session 2

Incremental Decoding for Phrase-Based Statistical Machine Translation

Baskaran Sankaran, Ajeet Grewal and Anoop Sarkar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216 [PDF]

 

Full Paper Session 3

How to Avoid Burning Ducks: Combining Linguistic Analysis and Corpus Statistics for German Compound Processing

Fabienne Fritzinger and Alexander Fraser . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224 [PDF]

Chunk-Based Verb Reordering in VSO Sentences for Arabic-English Statistical Machine Translation

Arianna Bisazza and Marcello Federico . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235 [PDF]

Head Finalization: A Simple Reordering Rule for SOV Languages

Hideki Isozaki, Katsuhito Sudoh, Hajime Tsukada and Kevin Duh . . . . . . . . . . . . . . . . . . . . . . . . . . 244 [PDF]

Aiding Pronoun Translation with Co-Reference Resolution

Ronan Le Nagard and Philipp Koehn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252 [PDF]

 

Friday, July 16, 2010

 

Shared Task Presentations

Overview: MetricsMATR [not available]

 

Poster Session: Full Paper

Jane: Open Source Hierarchical Translation, Extended with Reordering and Lexicon Models

David Vilar, Daniel Stein, Matthias Huck and Hermann Ney . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262 [PDF]

 

Poster Session: System Combination Task

MANY: Open Source MT System Combination at WMT’10

Loïc Barrault . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271 [PDF]

Adaptive Model Weighting and Transductive Regression for Predicting Best System Combinations

Ergun Biçici and S. Serdar Kozat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 276 [PDF]

L1 Regularized Regression for Reranking and System Combination in Machine Translation

Ergun Biçici and Deniz Yuret . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 282 [PDF]

An Augmented Three-Pass System Combination Framework: DCU Combination System for WMT 2010

Jinhua Du, Pavel Pecina and Andy Way . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 290 [PDF]

The UPV-PRHLT Combination System for WMT 2010

Jesús González-Rubio, Germán Sanchis-Trilles, Joan-Andreu Sánchez, Jesús Andrés-Ferrer, Guillem

Gascó, Pascual Martínez-Gómez, Martha-Alicia Rocha and Francisco Casacuberta . . . . . . . . . . . 296 [PDF]

CMU Multi-Engine Machine Translation for WMT 2010

Kenneth Heafield and Alon Lavie . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301 [PDF]

CMU System Combination via Hypothesis Selection for WMT’10

Almut Silja Hildebrand and Stephan Vogel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307 [PDF]

JHU System Combination Scheme for WMT 2010

Sushant Narsale . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 311 [PDF]

The RWTH System Combination System for WMT 2010

Gregor Leusch and Hermann Ney . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315 [PDF]

BBN System Description for WMT10 System Combination Task

Antti-Veikko Rosti, Bing Zhang, Spyros Matsoukas and Richard Schwartz . . . . . . . . . . . . . . . . . . 321 [PDF]

LRscore for Evaluating Lexical and Reordering Quality in MT

Alexandra Birch and Miles Osborne . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 327 [PDF]

Document-Level Automatic MT Evaluation based on Discourse Representations

Elisabet Comelles, Jesus Giménez, Lluís Màrquez, Irene Castellón and Victoria Arranz . . . . . . . 333 [PDF]

METEOR-NEXT and the METEOR Paraphrase Tables: Improved Evaluation Support for Five Target Languages

Michael Denkowski and Alon Lavie . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 339 [PDF]

Normalized Compression Distance Based Measures for MetricsMATR 2010

Marcus Dobrinkat, Tero Tapiovaara, Jaakko Väyrynen and Kimmo Kettunen . . . . . . . . . . . . . . . . 343 [PDF]

The DCU Dependency-Based Metric in WMT-MetricsMATR 2010

Yifan He, Jinhua Du, Andy Way and Josef van Genabith. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .349 [PDF]

TESLA: Translation Evaluation of Sentences with Linear-Programming-Based Analysis

Chang Liu, Daniel Dahlmeier and Hwee Tou Ng. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354 [PDF]

The Parameter-Optimized ATEC Metric for MT Evaluation

Billy Wong and Chunyu Kit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 360 [PDF]

 

Full Paper Session 4

A Unified Approach to Minimum Risk Training and Decoding

Abhishek Arun, Barry Haddow and Philipp Koehn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 365 [PDF]

N-Best Reranking by Multitask Learning

Kevin Duh, Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki and Masaaki Nagata . . . . . . . . . . 375 [PDF]

Taming Structured Perceptrons on Wild Feature Vectors

Ralf Brown . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 384 [PDF]

Translation Model Adaptation by Resampling

Kashif Shah, Loïc Barrault and Holger Schwenk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 392 [PDF]

Full Paper Session 5

Integration of Multiple Bilingually-Learned Segmentation Schemes into Statistical Machine Translation

Michael Paul, Andrew Finch and Eiichiro Sumita . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400 [PDF]

Improved Translation with Source Syntax Labels

Hieu Hoang and Philipp Koehn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409 [PDF]

Divide and Translate: Improving Long Distance Reordering in Statistical Machine Translation

Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, Tsutomu Hirao and Masaaki Nagata . . . . . . . . . 418 [PDF]

Decision Trees for Lexical Smoothing in Statistical Machine Translation

Rabih Zbib, Spyros Matsoukas, Richard Schwartz and John Makhoul . . . . . . . . . . . . . . . . . . . . . . . 428 [PDF]