ACL 2010
Joint Fifth Workshop on
Statistical
Machine Translation
and MetricsMATR
Proceedings of the Workshop
15-16 July 2010
Programme and Table of Contents
Introduction
Chris Callison-Burch, Philipp
Koehn, Christof Monz, Kay Peterson, Omar Zaidan ……………….. iii [PDF]
Thursday, July 15, 2010
Full Paper Session 1
A Semi-Supervised Word Alignment Algorithm
with Partial Manual Alignments
Qin Gao, Nguyen Bach and Stephan Vogel . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . 1 [PDF]
Fast Consensus Hypothesis Regeneration for
Machine Translation
Boxing Chen, George Foster and Roland Kuhn
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . 11 [PDF]
Shared Translation Task
Findings of the 2010 Joint Workshop on
Statistical Machine Translation and Metrics for Machine Translation
Chris Callison-Burch, Philipp Koehn,
Christof Monz, Kay Peterson, Mark Przybocki and Omar
Zaidan . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
17 [PDF]
Poster Session 1: Translation Task
LIMSI’s Statistical Translation Systems for
WMT’10
Alexandre Allauzen, Josep M. Crego,
˙Ilknur Durgar El-Kahlout and François Yvon . . . . . . . . . . . 54 [PDF]
2010 Failures in English-Czech Phrase-Based
MT
Ondřej
Bojar and Kamil Kos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 [PDF]
An Empirical Study on Development Set
Selection Strategy for Machine Translation Learning
Hui Cong, Zhao Hai, Lu Bao-Liang and Song
Yan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . 67 [PDF]
The
Vladimir Eidelman, Chris Dyer and Philip
Resnik . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . .72 [PDF]
Further Experiments with Shallow
Christian Federmann, Andreas Eisele, Yu
Chen, Sabine Hunsicker, Jia Xu and Hans Uszkoreit . 77 [PDF]
Improved Features and Grammar Selection for
Syntax-Based MT
Greg Hanneman, Jonathan Clark and Alon Lavie
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . .82 [PDF]
FBK at WMT 2010: Word Lattices for
Morphological Reduction and Chunk-Based Reordering
Christian Hardmeier, Arianna Bisazza and
Marcello Federico . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
88 [PDF]
The RWTH
Carmen Heger, Joern Wuebker, Matthias
Huck, Gregor Leusch, Saab Mansour, Daniel Stein
and Hermann Ney . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .93 [PDF]
Using Collocation Segmentation to Augment
the Phrase Table
Carlos A. Henríquez Q., Marta Ruiz Costa-jussà, Vidas Daudaravicius, Rafael E. Banchs
and José B. Mariño . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 98 [PDF]
The RALI Machine Translation System for WMT
2010
Stéphane Huet, Julien Bourdaillet,
Alexandre Patry and Philippe Langlais . . . . . . . . . . . . . . . . . . . 103 [PDF]
Exodus - Exploring SMT for EU Institutions
Michael Jellinghaus, Alexandros Poulis and
David Kolovratnik . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 [PDF]
More Linguistic Annotation for Statistical
Machine Translation
Philipp Koehn, Barry Haddow, Philip
Williams and Hieu Hoang. . . . . . . . . . . . . . . . . . . . . . . . . . . .115 [PDF]
LIUM SMT Machine Translation System for WMT
2010
Patrik Lambert,
Sadaf Abdul-Rauf and Holger Schwenk . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . 121 [PDF]
Lessons from NRC’s Portage System at WMT
2010
Samuel Larkin, Boxing Chen, George Foster,
Ulrich Germann, Eric Joanis, Howard Johnson
and Roland Kuhn . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . 127 [PDF]
Joshua 2.0: A Toolkit for Parsing-Based
Machine Translation with Syntax, Semirings, Discriminative
Training and Other Goodies
Zhifei Li, Chris Callison-Burch, Chris
Dyer, Juri Ganitkevitch, Ann Irvine, Sanjeev Khudanpur,
Lane Schwartz, Wren Thornton, Ziyuan Wang,
Jonathan Weese and Omar Zaidan . . . . . . . . . . . .133 [PDF]
The Karlsruhe Institute for Technology
Translation System for the ACL-WMT 2010
Jan Niehues, Teresa Herrmann, Mohammed
Mediani and Alex Waibel . . . . . . . . . . . . . . . . . . . . . . 138 [PDF]
MATREX: The DCU MT System for WMT 2010
Sergio Penkale, Rejwanul Haque, Sandipan
Dandapat, Pratyush Banerjee, Ankit K. Srivastava,
Jinhua Du, Pavel Pecina, Sudip Kumar
Naskar, Mikel L. Forcada and
The Cunei Machine Translation Platform for
WMT ’10
Aaron Phillips . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . 149 [PDF]
The CUED HiFST System for the WMT10
Translation Shared Task
Juan Pino, Gonzalo Iglesias, Adrià de Gispert, Graeme Blackwood, Jamie Brunning
and William Byrne . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . 155 [PDF]
The LIG Machine Translation System for WMT
2010
Marion Potet, Laurent Besacier and Hervé
Blanchon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . 161 [PDF]
Linear Inversion Transduction Grammar
Alignments as a Second Translation Path
Markus Saers, Joakim Nivre and Dekai Wu .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . 167 [PDF]
UPV-PRHLT English–Spanish System for WMT10
Germán
Sanchis-Trilles, Jesús Andrés-Ferrer, Guillem
Gascó, Jesús González-Rubio, Pascual Martínez-Gómez,
Martha-Alicia Rocha, Joan-Andreu Sánchez and Francisco Casacuberta . . . . . . . . . . . . .
…. . . . . 172 [PDF]
Reproducible Results in Parsing-Based
Machine Translation: The JHU Shared Task Submission
Lane Schwartz . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . 177 [PDF]
Vs and OOVs: Two Problems for Translation
between German and English
Sara Stymne, Maria Holmqvist and Lars
Ahrenberg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . 183 [PDF]
To Cache or Not To Cache? Experiments with
Adaptive Models in Statistical Machine Translation
Jörg Tiedemann . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . 189 [PDF]
Applying Morphological Decompositions to
Statistical Machine Translation
Sami Virpioja, Jaakko Väyrynen, André
Mansikkaniemi and Mikko Kurimo . . . . . . . . . . . . . . . . . 195 [PDF]
Maximum Entropy Translation Model in
Dependency-Based MT Framework
Zdeněk
Žabokrtský,
Martin Popel and David Mareček . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . .201 [PDF]
UCH-UPV English–Spanish System for WMT10
Francisco Zamora-Martínez and Germán
Sanchis-Trilles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . 207 [PDF]
Hierarchical Phrase-Based MT at the
Daniel Zeman . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . 212 [PDF]
Invited
Talk – Hermann Ney [not available]
Full Paper Session 2
Incremental Decoding for Phrase-Based
Statistical Machine Translation
Baskaran
Sankaran, Ajeet Grewal and Anoop Sarkar . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . 216 [PDF]
Full Paper Session 3
How to Avoid
Burning Ducks: Combining Linguistic Analysis and Corpus Statistics for German
Compound Processing
Fabienne Fritzinger and Alexander Fraser .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . 224 [PDF]
Chunk-Based Verb Reordering in VSO Sentences
for Arabic-English Statistical Machine Translation
Arianna Bisazza and Marcello Federico . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . 235 [PDF]
Head Finalization: A Simple Reordering Rule
for SOV Languages
Hideki Isozaki, Katsuhito Sudoh, Hajime
Tsukada and Kevin Duh . . . . . . . . . . . . . . . . . . . . . . . . . . 244 [PDF]
Aiding Pronoun Translation with Co-Reference
Resolution
Ronan Le Nagard and Philipp Koehn . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . 252 [PDF]
Friday, July 16, 2010
Shared Task Presentations
Overview: MetricsMATR [not available]
Poster Session: Full Paper
Jane: Open Source Hierarchical Translation,
Extended with Reordering and Lexicon Models
David Vilar, Daniel Stein, Matthias Huck
and Hermann Ney . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262 [PDF]
Poster Session: System Combination Task
MANY: Open
Loïc
Barrault . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271 [PDF]
Adaptive Model Weighting and Transductive
Regression for Predicting Best System Combinations
Ergun Biçici
and S. Serdar Kozat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . 276 [PDF]
L1 Regularized Regression for Reranking and
System Combination in Machine Translation
Ergun Biçici
and Deniz Yuret . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . 282 [PDF]
An Augmented Three-Pass System Combination
Framework: DCU Combination System for WMT 2010
Jinhua Du, Pavel Pecina and
The UPV-PRHLT Combination System for WMT
2010
Jesús González-Rubio, Germán
Sanchis-Trilles, Joan-Andreu Sánchez, Jesús Andrés-Ferrer, Guillem
Gascó,
Pascual Martínez-Gómez,
Martha-Alicia Rocha and Francisco Casacuberta . . . . . . . . . . . 296 [PDF]
CMU Multi-Engine Machine Translation for WMT
2010
Kenneth Heafield and Alon Lavie . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . 301 [PDF]
CMU System Combination via Hypothesis
Selection for WMT’10
Almut Silja Hildebrand and Stephan Vogel .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . 307 [PDF]
JHU System Combination Scheme for WMT 2010
Sushant Narsale . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . 311 [PDF]
The RWTH System Combination System for WMT
2010
Gregor Leusch and Hermann Ney . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . 315 [PDF]
BBN System Description for WMT10 System
Combination Task
Antti-Veikko Rosti, Bing Zhang, Spyros
Matsoukas and Richard Schwartz . . . . . . . . . . . . . . . . . . 321 [PDF]
LRscore for Evaluating Lexical and
Reordering Quality in MT
Alexandra Birch and Miles Osborne . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . 327 [PDF]
Document-Level Automatic MT Evaluation based
on Discourse Representations
Elisabet Comelles, Jesus Giménez, Lluís Màrquez, Irene Castellón and Victoria Arranz . . . . . . . 333 [PDF]
METEOR-NEXT and the METEOR Paraphrase
Tables: Improved Evaluation Support for Five Target Languages
Michael Denkowski
and Alon Lavie . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . 339 [PDF]
Normalized Compression Distance Based
Measures for MetricsMATR 2010
Marcus Dobrinkat, Tero Tapiovaara, Jaakko
Väyrynen and Kimmo Kettunen . . . . . . . . . . . . . . . . 343 [PDF]
The DCU Dependency-Based Metric in
WMT-MetricsMATR 2010
Yifan He, Jinhua Du,
TESLA: Translation Evaluation of Sentences
with Linear-Programming-Based Analysis
Chang Liu, Daniel Dahlmeier and Hwee Tou
Ng. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . 354 [PDF]
The Parameter-Optimized ATEC Metric for MT
Evaluation
Billy Wong and Chunyu Kit . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . 360 [PDF]
Full Paper Session 4
A Unified Approach to Minimum Risk Training
and Decoding
Abhishek Arun, Barry Haddow and Philipp
Koehn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . 365 [PDF]
N-Best Reranking by Multitask Learning
Kevin Duh, Katsuhito Sudoh, Hajime
Tsukada, Hideki Isozaki and Masaaki Nagata . . . . . . . . . . 375 [PDF]
Taming Structured Perceptrons on Wild
Feature Vectors
Ralf Brown . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . 384 [PDF]
Translation Model Adaptation by Resampling
Kashif Shah, Loïc Barrault and Holger Schwenk . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . 392 [PDF]
Full Paper Session 5
Integration of Multiple Bilingually-Learned Segmentation
Schemes into Statistical Machine Translation
Michael Paul, Andrew Finch and Eiichiro
Sumita . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . 400 [PDF]
Improved Translation with Source Syntax
Labels
Hieu Hoang and Philipp Koehn . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . 409 [PDF]
Divide and Translate: Improving Long
Distance Reordering in Statistical Machine Translation
Katsuhito Sudoh, Kevin Duh, Hajime
Tsukada, Tsutomu Hirao and Masaaki Nagata . . . . . . . . . 418 [PDF]
Decision Trees for Lexical Smoothing in
Statistical Machine Translation
Rabih Zbib,
Spyros Matsoukas, Richard Schwartz and John Makhoul . . . . . . . . . . . . . .
. . . . . . . . . 428 [PDF]