Multiword Expressions in Machine Translation

Valia Kordoni, Iliana Simova


Abstract
This work describes an experimental evaluation of the significance of phrasal verb treatment for obtaining better quality statistical machine translation (SMT) results. The importance of the detection and special treatment of phrasal verbs is measured in the context of SMT, where the word-for-word translation of these units often produces incoherent results. Two ways of integrating phrasal verb information in a phrase-based SMT system are presented. Automatic and manual evaluations of the results reveal improvements in the translation quality in both experiments.
Anthology ID:
L14-1567
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1208–1211
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/723_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Valia Kordoni and Iliana Simova. 2014. Multiword Expressions in Machine Translation. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 1208–1211, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Multiword Expressions in Machine Translation (Kordoni & Simova, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/723_Paper.pdf