LIUM Submission for IWSLT 2026 Low-resource Speech Translation Track

Mohammad Mohammadamini, Marie Tahon


Abstract
This paper describes the LIUM submission to the IWSLT 2026 low-resource speech translation track. It proposes different data augmentation methods for low-resource speech-to-text translation, including two main pipelines: pseudo-labeling and speech synthesis. The goal is to generate parallel speech data in low-resource scenarios without relying on human-annotated speech translation data. Our submission focuses on Central Kurdish–English language pairs. The objective of this work is to explore the advantages and limitations of each data augmentation method. Our best results are obtained using the pseudo-labeling pipeline, achieving a BLEU score of 25.73 on the development set and 21.09 on the test set for Central Kurdish–English translation.
Anthology ID:
2026.iwslt-1.15
Volume:
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Month:
July
Year:
2026
Address:
San Diego, USA (in-person and online)
Editors:
Elizabeth Salesky, Antonios Anastasopoulos, Matteo Negri, Marcello Federico
Venues:
IWSLT | WS
SIG:
SIGSLT
Publisher:
Association for Computational Linguistics
Note:
Pages:
124–131
Language:
URL:
https://aclanthology.org/2026.iwslt-1.15/
DOI:
Bibkey:
Cite (ACL):
Mohammad Mohammadamini and Marie Tahon. 2026. LIUM Submission for IWSLT 2026 Low-resource Speech Translation Track. In Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026), pages 124–131, San Diego, USA (in-person and online). Association for Computational Linguistics.
Cite (Informal):
LIUM Submission for IWSLT 2026 Low-resource Speech Translation Track (Mohammadamini & Tahon, IWSLT 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.iwslt-1.15.pdf