HW-TSC’s Submissions To the IWSLT2024 Low-resource Speech Translation Tasks

Zheng Jiawei, Hengchao Shang, Zongyao Li, Zhanglin Wu, Daimeng Wei, Zhiqiang Rao, Shaojun Li, Jiaxin Guo, Bin Wei, Yuanchang Luo, Hao Yang


Abstract
In this work, we submitted our systems to the low-resource track of the IWSLT 2024 Speech Translation Campaign. Our systems tackled the unconstrained condition of the Dialectal Arabic North Levantine (ISO-3 code: apc) to English language pair. We proposed a cascaded solution consisting of an automatic speech recognition (ASR) model and a machine translation (MT) model. It was noted that the ASR model employed the pre-trained Whisper-large-v3 model to process the speech data, while the MT model adopted the Transformer architecture. To improve the quality of the MT model, it was stated that our system utilized not only the data provided by the competition but also an additional 54 million parallel sentences. Ultimately, we reported that our final system achieved a BLEU score of 24.7 for apc-to-English translation.
Anthology ID:
2024.iwslt-1.21
Volume:
Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024)
Month:
August
Year:
2024
Address:
Bangkok, Thailand (in-person and online)
Editors:
Elizabeth Salesky, Marcello Federico, Marine Carpuat
Venue:
IWSLT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
160–163
Language:
URL:
https://aclanthology.org/2024.iwslt-1.21
DOI:
10.18653/v1/2024.iwslt-1.21
Bibkey:
Cite (ACL):
Zheng Jiawei, Hengchao Shang, Zongyao Li, Zhanglin Wu, Daimeng Wei, Zhiqiang Rao, Shaojun Li, Jiaxin Guo, Bin Wei, Yuanchang Luo, and Hao Yang. 2024. HW-TSC’s Submissions To the IWSLT2024 Low-resource Speech Translation Tasks. In Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024), pages 160–163, Bangkok, Thailand (in-person and online). Association for Computational Linguistics.
Cite (Informal):
HW-TSC’s Submissions To the IWSLT2024 Low-resource Speech Translation Tasks (Jiawei et al., IWSLT 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.iwslt-1.21.pdf