HW-TSC’s Submissions to the IWSLT 2026 Offline Speech Translation Task

Boqi Huang; Daimeng Wei; Jiaxin Guo; Yuanchang Luo; Hengchao Shang; Zongyao Li; Zhiqiang Rao; Jinlong Yang; Zhanglin Wu; Yu He; Xiaoqing Lan

HW-TSC’s Submissions to the IWSLT 2026 Offline Speech Translation Task

Boqi Huang, Daimeng Wei, Jiaxin GUO, Yuanchang Luo, Hengchao Shang, Zongyao Li, Zhiqiang Rao, Jinlong Yang, Zhanglin Wu, Yu He, Xiaoqing Lan

Abstract

This paper describes the HW-TSC’s submission to the IWSLT 2026 Offline Speech Translation Task, specifically for the English-to-Chinese and English-to-German unconstrained tracks. Our system adopts a robust cascade architecture optimized for long-form, unsegmented audio. To mitigate the hallucination and inconsistency issues common in long-sequence processing, we propose a two-pass transcription strategy: an initial streaming ASR with a 12-second context buffer for sentence-level coherence, followed by Qwen3-ForcedAligner for precise timestamping. Based on these alignments, a second-pass refinement is conducted using Qwen3-Omni on re-segmented 30-second chunks to ensure high-fidelity transcriptions. For the translation module, we employ a context-aware segment merging strategy (up to 150 tokens) to empower the Qwen3 llm with sufficient semantic context. Experimental results on the tst-2022 benchmark demonstrate the effectiveness of our pipeline, achieving COMET scores of 0.8462 (En-Zh) and 0.7854 (En-De), significantly outperforming the standard cascade baselines.

Anthology ID:: 2026.iwslt-1.9
Volume:: Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Month:: July
Year:: 2026
Address:: San Diego, USA (in-person and online)
Editors:: Elizabeth Salesky, Antonios Anastasopoulos, Matteo Negri, Marcello Federico
Venues:: IWSLT | WS
SIG:: SIGSLT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 84–90
Language:
URL:: https://aclanthology.org/2026.iwslt-1.9/
DOI:
Bibkey:
Cite (ACL):: Boqi Huang, Daimeng Wei, Jiaxin GUO, Yuanchang Luo, Hengchao Shang, Zongyao Li, Zhiqiang Rao, Jinlong Yang, Zhanglin Wu, Yu He, and Xiaoqing Lan. 2026. HW-TSC’s Submissions to the IWSLT 2026 Offline Speech Translation Task. In Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026), pages 84–90, San Diego, USA (in-person and online). Association for Computational Linguistics.
Cite (Informal):: HW-TSC’s Submissions to the IWSLT 2026 Offline Speech Translation Task (Huang et al., IWSLT 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.iwslt-1.9.pdf

PDF Cite Search Fix data