From Simultaneous to Streaming Machine Translation by Leveraging Streaming History

Javier Iranzo-Sánchez, Jorge Civera, Alfons Juan


Abstract
Simultaneous Machine Translation is the task of incrementally translating an input sentence before it is fully available. Currently, simultaneous translation is carried out by translating each sentence independently of the previously translated text. More generally, Streaming MT can be understood as an extension of Simultaneous MT to the incremental translation of a continuous input text stream. In this work, a state-of-the-art simultaneous sentence-level MT system is extended to the streaming setup by leveraging the streaming history. Extensive empirical results are reported on IWSLT Translation Tasks, showing that leveraging the streaming history leads to significant quality gains. In particular, the proposed system proves to compare favorably to the best performing systems.
Anthology ID:
2022.acl-long.480
Volume:
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
May
Year:
2022
Address:
Dublin, Ireland
Editors:
Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
6972–6985
Language:
URL:
https://aclanthology.org/2022.acl-long.480
DOI:
10.18653/v1/2022.acl-long.480
Bibkey:
Cite (ACL):
Javier Iranzo-Sánchez, Jorge Civera, and Alfons Juan. 2022. From Simultaneous to Streaming Machine Translation by Leveraging Streaming History. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6972–6985, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
From Simultaneous to Streaming Machine Translation by Leveraging Streaming History (Iranzo-Sánchez et al., ACL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.acl-long.480.pdf
Software:
 2022.acl-long.480.software.zip
Video:
 https://aclanthology.org/2022.acl-long.480.mp4
Data
MuST-C