Attainable Text-to-Text Machine Translation vs. Translation: Issues Beyond Linguistic Processing

Atsushi Fujita


Abstract
Existing approaches for machine translation (MT) mostly translate given text in the source language into the target language and without explicitly referring to information indispensable for producing proper translation. This includes not only information in other textual elements and modalities than texts in the same document and but also extra-document and non-linguistic information and such as norms and skopos. To design better translation production work-flows and we need to distinguish translation issues that could be resolved by the existing text-to-text approaches and those beyond them. To this end and we conducted an analytic assessment of MT outputs and taking an English-to-Japanese news translation task as a case study. First and examples of translation issues and their revisions were collected by a two-stage post-editing (PE) method: performing minimal PE to obtain translation attainable based on the given textual information and further performing full PE to obtain truly acceptable translation referring to any information if necessary. Then and the collected revision examples were manually analyzed. We revealed dominant issues and information indispensable for resolving them and such as fine-grained style specifications and terminology and domain-specific knowledge and and reference documents and delineating a clear distinction between translation and what text-to-text MT can ultimately attain.
Anthology ID:
2021.mtsummit-research.18
Volume:
Proceedings of Machine Translation Summit XVIII: Research Track
Month:
August
Year:
2021
Address:
Virtual
Editors:
Kevin Duh, Francisco Guzmán
Venue:
MTSummit
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
215–230
Language:
URL:
https://aclanthology.org/2021.mtsummit-research.18
DOI:
Bibkey:
Cite (ACL):
Atsushi Fujita. 2021. Attainable Text-to-Text Machine Translation vs. Translation: Issues Beyond Linguistic Processing. In Proceedings of Machine Translation Summit XVIII: Research Track, pages 215–230, Virtual. Association for Machine Translation in the Americas.
Cite (Informal):
Attainable Text-to-Text Machine Translation vs. Translation: Issues Beyond Linguistic Processing (Fujita, MTSummit 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.mtsummit-research.18.pdf
Code
 akfujita/staged-pe