The CUHKSZ System for the IWSLT 2026 Low-Resource Speech-to-Text Task

ruiyan SUN, Qingming Li, Satoshi Nakamura


Abstract
This paper describes the CUHKSZ system for the IWSLT 2026 Low-Resource Speech-to-Text task. We propose Gradient-Driven Parameter Sharing (GDPS), a framework that analyzes inter-language gradient behaviors to automatically determine optimal language groupings and shared-private parameter ratios. Built upon SeamlessM4T-Medium, GDPS reduces negative transfer by specializing Layer 11 FFN2 while maintaining shared encoder representations across languages. Additionally, we incorporate curriculum distillation with progressive pseudo-label mixing and test-time reranking combining prior-BLEU weighting and self-consistency scoring. Evaluation on eight low-resource languages (bem, ckb, gle, hau, ibo, yor, aeb, est) demonstrates strongest gains on bem (+2.07 BLEU), hau (+1.50), and ibo (+0.38) compared to unified fine-tuning, while ckb and yor benefit more from prior-based reranking at inference.
Anthology ID:
2026.iwslt-1.33
Volume:
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Month:
July
Year:
2026
Address:
San Diego, USA (in-person and online)
Editors:
Elizabeth Salesky, Antonios Anastasopoulos, Matteo Negri, Marcello Federico
Venues:
IWSLT | WS
SIG:
SIGSLT
Publisher:
Association for Computational Linguistics
Note:
Pages:
296–304
Language:
URL:
https://aclanthology.org/2026.iwslt-1.33/
DOI:
Bibkey:
Cite (ACL):
ruiyan SUN, Qingming Li, and Satoshi Nakamura. 2026. The CUHKSZ System for the IWSLT 2026 Low-Resource Speech-to-Text Task. In Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026), pages 296–304, San Diego, USA (in-person and online). Association for Computational Linguistics.
Cite (Informal):
The CUHKSZ System for the IWSLT 2026 Low-Resource Speech-to-Text Task (SUN et al., IWSLT 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.iwslt-1.33.pdf