The HW-TSC’s Simultaneous Speech-to-Text Translation System for IWSLT 2023 Evaluation

Jiaxin Guo; Daimeng Wei; Zhanglin Wu; Zongyao Li; Zhiqiang Rao; Minghan Wang; Hengchao Shang; Xiaoyu Chen; Zhengzhe Yu; Shaojun Li; Yuhao Xie; Lizhi Lei; Hao Yang

doi:10.18653/v1/2023.iwslt-1.35

The HW-TSC’s Simultaneous Speech-to-Text Translation System for IWSLT 2023 Evaluation

Jiaxin Guo, Daimeng Wei, Zhanglin Wu, Zongyao Li, Zhiqiang Rao, Minghan Wang, Hengchao Shang, Xiaoyu Chen, Zhengzhe Yu, Shaojun Li, Yuhao Xie, Lizhi Lei, Hao Yang

Abstract

In this paper, we present our submission to the IWSLT 2023 Simultaneous Speech-to-Text Translation competition. Our participation involves three language directions: English-German, English-Chinese, and English-Japanese. Our proposed solution is a cascaded incremental decoding system that comprises an ASR model and an MT model. The ASR model is based on the U2++ architecture and can handle both streaming and offline speech scenarios with ease. Meanwhile, the MT model adopts the Deep-Transformer architecture. To improve performance, we explore methods to generate a confident partial target text output that guides the next MT incremental decoding process. In our experiments, we demonstrate that our simultaneous strategies achieve low latency while maintaining a loss of no more than 2 BLEU points when compared to offline systems.

Anthology ID:: 2023.iwslt-1.35
Volume:: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023)
Month:: July
Year:: 2023
Address:: Toronto, Canada (in-person and online)
Editors:: Elizabeth Salesky, Marcello Federico, Marine Carpuat
Venue:: IWSLT
SIG:: SIGSLT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 376–382
Language:
URL:: https://aclanthology.org/2023.iwslt-1.35/
DOI:: 10.18653/v1/2023.iwslt-1.35
Bibkey:
Cite (ACL):: Jiaxin Guo, Daimeng Wei, Zhanglin Wu, Zongyao Li, Zhiqiang Rao, Minghan Wang, Hengchao Shang, Xiaoyu Chen, Zhengzhe Yu, Shaojun Li, Yuhao Xie, Lizhi Lei, and Hao Yang. 2023. The HW-TSC’s Simultaneous Speech-to-Text Translation System for IWSLT 2023 Evaluation. In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), pages 376–382, Toronto, Canada (in-person and online). Association for Computational Linguistics.
Cite (Informal):: The HW-TSC’s Simultaneous Speech-to-Text Translation System for IWSLT 2023 Evaluation (Guo et al., IWSLT 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.iwslt-1.35.pdf

PDF Cite Search Fix data