Alexander Waibel
Other people with similar names: Alex Waibel
Unverified author pages with similar names: Alexander Waibel
2026
KIT’s Submission to Cross-Lingual Voice Cloning in IWSLT 2026
Seymanur Akti | Alexander Waibel
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Seymanur Akti | Alexander Waibel
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Cross-lingual voice cloning aims to generate speech in a target language while preserving speaker identity from a source-language reference. This task is central to speech translation and is the focus of the IWSLT 2026 Cross-Lingual Voice Cloning track. A key challenge is maintaining intelligibility and naturalness in the presence of accent variation and domain-specific vocabulary. We build on a multilingual text-to-speech model, FishAudio-S2-Pro, and introduce language tag prompting to improve language control and reduce accent leakage. We further apply reinforcement learning (RL) fine-tuning for task adaptation and observe improvements in intelligibility. Finally, we propose a reference-conditioned lexical matching method that improves pronunciation of domain-specific terms when lexical overlap is present. Results show that language prompting provides the largest gains, while lexical matching yields consistent improvements on matched subsets.
Speech Translation and Metrics in 2026: Findings of the IWSLT Campaign
David Ifeoluwa Adelani | Victor Agostinelli | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Sébastien Bratières | Marine Carpuat | Fabrício Carraro | Roldano Cattoni | Mauro Cettolo | Lizhong Chen | Marcello Federico | Marco Gaido | Mahendra Gupta | HyoJung Han | Ali Hatami | Lewis C. Howe | Dávid Javorský | Yejin Jeon | Marek Kasztelnik | Antoine Laurent | Danni Liu | Nam Luu | Min Ma | Dominik Macháček | Marie Maltais | Evgeny Matusov | John McCrae | Chutong Meng | Chandresh Kumar Maurya | Mohammad Mohammadamini | Yasmin Moslem | Kenton Murray | Satoshi Nakamura | Matteo Negri | Jan Niehues | Atul Kr. Ojha | John E. Ortega | Siqi Ouyang | Sara Papi | Peter Polák | Fabian Retkowski | Stephanny Sánchez | Beatrice Savoldi | Claytone Sikasote | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Marie Tahon | Marco Turchi | Alexander Waibel | Patrick Wilken | Rodolfo Joel Zevallos | Vilem Zouhar | Maike Züfle
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
David Ifeoluwa Adelani | Victor Agostinelli | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Sébastien Bratières | Marine Carpuat | Fabrício Carraro | Roldano Cattoni | Mauro Cettolo | Lizhong Chen | Marcello Federico | Marco Gaido | Mahendra Gupta | HyoJung Han | Ali Hatami | Lewis C. Howe | Dávid Javorský | Yejin Jeon | Marek Kasztelnik | Antoine Laurent | Danni Liu | Nam Luu | Min Ma | Dominik Macháček | Marie Maltais | Evgeny Matusov | John McCrae | Chutong Meng | Chandresh Kumar Maurya | Mohammad Mohammadamini | Yasmin Moslem | Kenton Murray | Satoshi Nakamura | Matteo Negri | Jan Niehues | Atul Kr. Ojha | John E. Ortega | Siqi Ouyang | Sara Papi | Peter Polák | Fabian Retkowski | Stephanny Sánchez | Beatrice Savoldi | Claytone Sikasote | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Marie Tahon | Marco Turchi | Alexander Waibel | Patrick Wilken | Rodolfo Joel Zevallos | Vilem Zouhar | Maike Züfle
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
This paper reports on the outcomes of the shared tasks organized as part of the 23rd International Workshop on Spoken Language Translation (IWSLT). The workshop covered ten major challenges in spoken language translation, including speech-to-text translation for both high-resource and low-resource language pairs, customized speech translation, speech generation, instruction-following speech processing, and the evaluation of speech translation systems. The shared tasks received strong participation, with more than 30 teams submitting runs. This year’s edition broadened the range of tasks, placing particular emphasis on speech generation and evaluation metrics.
Multilingual Long-Form Speech Instruction Following: KIT’s Submission to IWSLT 2026
Enes Yavuz Ugan | Maike Züfle | Yuka Ko | Supriti Sinhamahapatra | Fabian Retkowski | Seymanur Akti | Jan Niehues | Alexander Waibel
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Enes Yavuz Ugan | Maike Züfle | Yuka Ko | Supriti Sinhamahapatra | Fabian Retkowski | Seymanur Akti | Jan Niehues | Alexander Waibel
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
With the advent of Large Language Models, single-task and token-based multi-task models have evolved into instruction-based systems that infer task and target language implicitly from natural language prompts. This trend is reflected in IWSLT’s Instruction Following Track, which this year introduced new tasks including an unknown surprise task, posing a genuine challenge against overfitting to known tasks. We present KIT’s submission to the Long and Short Instruction Following tracks in the unconstrained setting. Our approach combines a general data augmentation pipeline that converts short-form corpora into long-form training data through segment concatenation, LLM-based label generation, and cross-lingual translation, yielding over 1M instances across six tasks and four languages. We further show that likelihood-based re-ranking, while highly effective for ASR, systematically degrades semantic tasks by spuriously selecting candidates generated from segmented audio processing rather than holistic long-form inference, a failure mode resolved by combining likelihood with Minimum Bayes Risk decoding.
Search
Fix author
Co-authors
- Seymanur Akti 2
- Jan Niehues 2
- Fabian Retkowski 2
- Maike Züfle 2
- David Ifeoluwa Adelani 1
- Victor Agostinelli 1
- Antonios Anastasopoulos 1
- Luisa Bentivogli 1
- Ondřej Bojar 1
- Sébastien Bratières 1
- Marine Carpuat 1
- Fabrício Carraro 1
- Roldano Cattoni 1
- Mauro Cettolo 1
- Lizhong Chen 1
- Marcello Federico 1
- Marco Gaido 1
- Mahendra Gupta 1
- HyoJung Han 1
- Ali Hatami 1
- Lewis C. Howe 1
- Dávid Javorský 1
- Yejin Jeon 1
- Marek Kasztelnik 1
- Yuka Ko 1
- Antoine Laurent 1
- Danni Liu 1
- Nam Luu 1
- Min Ma 1
- Dominik Macháček 1
- Marie Maltais 1
- Evgeny Matusov 1
- Chandresh Kumar Maurya 1
- John Philip McCrae 1
- Chutong Meng 1
- Mohammad Mohammadamini 1
- Yasmin Moslem 1
- Kenton Murray 1
- Satoshi Nakamura 1
- Matteo Negri 1
- Atul Kr. Ojha 1
- John E. Ortega 1
- Siqi Ouyang 1
- Sara Papi 1
- Peter Polák 1
- Beatrice Savoldi 1
- Claytone Sikasote 1
- Supriti Sinhamahapatra 1
- Matthias Sperber 1
- Sebastian Stüker 1
- Katsuhito Sudoh 1
- Stephanny Sánchez 1
- Marie Tahon 1
- Marco Turchi 1
- Enes Yavuz Ugan 1
- Patrick Wilken 1
- Rodolfo Zevallos 1
- Vilém Zouhar 1