One Voice, Many Tongues: Cross-Lingual Voice Cloning for Scientific Speech

Amanuel Gizachew Abebe, Yasmin Moslem


Abstract
Preserving a speaker’s voice identity while generating speech in a different language remains a fundamental challenge in spoken language technology, particularly in specialized domains such as scientific communication. In this paper, we address this challenge through our system submission to the International Conference on Spoken Language Translation (IWSLT 2026), the Cross-Lingual Voice Cloning shared task. First, we evaluate several state-of-the-art voice cloning models for cross-lingual speech generation of scientific texts in Arabic, Chinese, and French. Then, we build voice cloning systems based on the OmniVoice foundation model. We employ data augmentation via multi-model ensemble distillation from the ACL 60/60 corpus. We investigate the effect of using this synthetic data for fine-tuning, demonstrating improvements in intelligibility (WER and CER) and speaker similarity (SIM), with gains varying across languages.
Anthology ID:
2026.iwslt-1.25
Volume:
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Month:
July
Year:
2026
Address:
San Diego, USA (in-person and online)
Editors:
Elizabeth Salesky, Antonios Anastasopoulos, Matteo Negri, Marcello Federico
Venues:
IWSLT | WS
SIG:
SIGSLT
Publisher:
Association for Computational Linguistics
Note:
Pages:
227–231
Language:
URL:
https://aclanthology.org/2026.iwslt-1.25/
DOI:
Bibkey:
Cite (ACL):
Amanuel Gizachew Abebe and Yasmin Moslem. 2026. One Voice, Many Tongues: Cross-Lingual Voice Cloning for Scientific Speech. In Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026), pages 227–231, San Diego, USA (in-person and online). Association for Computational Linguistics.
Cite (Informal):
One Voice, Many Tongues: Cross-Lingual Voice Cloning for Scientific Speech (Abebe & Moslem, IWSLT 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.iwslt-1.25.pdf