A Test Set for Discourse Translation from Japanese to English

Masaaki Nagata, Makoto Morishita


Abstract
We made a test set for Japanese-to-English discourse translation to evaluate the power of context-aware machine translation. For each discourse phenomenon, we systematically collected examples where the translation of the second sentence depends on the first sentence. Compared with a previous study on test sets for English-to-French discourse translation (CITATION), we needed different approaches to make the data because Japanese has zero pronouns and represents different senses in different characters. We improved the translation accuracy using context-aware neural machine translation, and the improvement mainly reflects the betterment of the translation of zero pronouns.
Anthology ID:
2020.lrec-1.457
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3704–3709
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.457
DOI:
Bibkey:
Cite (ACL):
Masaaki Nagata and Makoto Morishita. 2020. A Test Set for Discourse Translation from Japanese to English. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 3704–3709, Marseille, France. European Language Resources Association.
Cite (Informal):
A Test Set for Discourse Translation from Japanese to English (Nagata & Morishita, LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.457.pdf