Mohammad Mohammadamini
2026
Speech Translation and Metrics in 2026: Findings of the IWSLT Campaign
David Ifeoluwa Adelani | Victor Agostinelli | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Sébastien Bratières | Marine Carpuat | Fabrício Carraro | Roldano Cattoni | Mauro Cettolo | Lizhong Chen | Marcello Federico | Marco Gaido | Mahendra Gupta | HyoJung Han | Ali Hatami | Lewis C. Howe | Dávid Javorský | Yejin Jeon | Marek Kasztelnik | Antoine Laurent | Danni Liu | Nam Luu | Min Ma | Dominik Macháček | Marie Maltais | Evgeny Matusov | John McCrae | Chutong Meng | Chandresh Kumar Maurya | Mohammad Mohammadamini | Yasmin Moslem | Kenton Murray | Satoshi Nakamura | Matteo Negri | Jan Niehues | Atul Kr. Ojha | John E. Ortega | Siqi Ouyang | Sara Papi | Peter Polák | Fabian Retkowski | Stephanny Sánchez | Beatrice Savoldi | Claytone Sikasote | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Marie Tahon | Marco Turchi | Alexander Waibel | Patrick Wilken | Rodolfo Joel Zevallos | Vilem Zouhar | Maike Züfle
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
David Ifeoluwa Adelani | Victor Agostinelli | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Sébastien Bratières | Marine Carpuat | Fabrício Carraro | Roldano Cattoni | Mauro Cettolo | Lizhong Chen | Marcello Federico | Marco Gaido | Mahendra Gupta | HyoJung Han | Ali Hatami | Lewis C. Howe | Dávid Javorský | Yejin Jeon | Marek Kasztelnik | Antoine Laurent | Danni Liu | Nam Luu | Min Ma | Dominik Macháček | Marie Maltais | Evgeny Matusov | John McCrae | Chutong Meng | Chandresh Kumar Maurya | Mohammad Mohammadamini | Yasmin Moslem | Kenton Murray | Satoshi Nakamura | Matteo Negri | Jan Niehues | Atul Kr. Ojha | John E. Ortega | Siqi Ouyang | Sara Papi | Peter Polák | Fabian Retkowski | Stephanny Sánchez | Beatrice Savoldi | Claytone Sikasote | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Marie Tahon | Marco Turchi | Alexander Waibel | Patrick Wilken | Rodolfo Joel Zevallos | Vilem Zouhar | Maike Züfle
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
This paper reports on the outcomes of the shared tasks organized as part of the 23rd International Workshop on Spoken Language Translation (IWSLT). The workshop covered ten major challenges in spoken language translation, including speech-to-text translation for both high-resource and low-resource language pairs, customized speech translation, speech generation, instruction-following speech processing, and the evaluation of speech translation systems. The shared tasks received strong participation, with more than 30 teams submitting runs. This year’s edition broadened the range of tasks, placing particular emphasis on speech generation and evaluation metrics.
LIUM Submission for IWSLT 2026 Low-resource Speech Translation Track
Mohammad Mohammadamini | Marie Tahon
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Mohammad Mohammadamini | Marie Tahon
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
This paper describes the LIUM submission to the IWSLT 2026 low-resource speech translation track. It proposes different data augmentation methods for low-resource speech-to-text translation, including two main pipelines: pseudo-labeling and speech synthesis. The goal is to generate parallel speech data in low-resource scenarios without relying on human-annotated speech translation data. Our submission focuses on Central Kurdish–English language pairs. The objective of this work is to explore the advantages and limitations of each data augmentation method. Our best results are obtained using the pseudo-labeling pipeline, achieving a BLEU score of 25.73 on the development set and 21.09 on the test set for Central Kurdish–English translation.
Fleurs-Badini: Translation and Recording Fleurs Dataset for Badini Variant of Northern Kurdish
Mohammad Mohammadamini | Dilgash Mohammed Salih Tayib | Dezheen H. Abdulazeez | Barzan Hussein Mohammed | Imad Saeed Sadeeq | Aveen Jalal Mohammed | Amera Ismail Melhum | Abuobaida Abdullah Dheyab
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Mohammad Mohammadamini | Dilgash Mohammed Salih Tayib | Dezheen H. Abdulazeez | Barzan Hussein Mohammed | Imad Saeed Sadeeq | Aveen Jalal Mohammed | Amera Ismail Melhum | Abuobaida Abdullah Dheyab
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Multilingual speech benchmarks such as the FLEURS benchmark have significantly advanced research across a wide range of languages. However, important dialects, including Badini Kurdish, remain underrepresented, limiting bechmarking in automatic speech recognition (ASR) and speech-to-text translation (S2TT). To address this limitation, this study introduces FLEURS-Badini, a dialect-focused extension designed to support research on Northern Kurdish (Badini). The dataset is constructed through a structured process of translation, recording, and validation, resulting in 5,224 utterances paired with their corresponding translated text. The data were collected from 45 speakers. To evaluate the dataset, baseline experiments are conducted using state-of-the-art models for both ASR and S2TT. The results indicate that ASR remains challenging, with the best performance achieved by the W2V-BERT CTC model, reaching a Word Error Rate (WER) of approximately 55% on the test set. Similarly, speech-to-text translation performance is limited, with BLEU scores 6.13 and 5.24 on dev and test sets. Overall, FLEURS-Badini expands multilingual coverage and provides a standardized foundation for evaluating ASR and speech translation systems in the Badini dialect.
2025
Kuvost: A Large-Scale Human-Annotated English to Central Kurdish Speech Translation Dataset Driven from English Common Voice
Mohammad Mohammadamini | Daban Jaff | Sara Jamal | Ibrahim Ahmed | Hawkar Omar | Darya Sabr | Marie Tahon | Antoine Laurent
Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
Mohammad Mohammadamini | Daban Jaff | Sara Jamal | Ibrahim Ahmed | Hawkar Omar | Darya Sabr | Marie Tahon | Antoine Laurent
Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
In this paper, we introduce the Kuvost, a large-scale English to Central Kurdish speech-to-text-translation (S2TT) dataset. This dataset includes 786k utterances derived from Common Voice 18, translated and revised by 230 volunteers into Central Kurdish. Encompassing 1,003 hours of translated speech, this dataset can play a groundbreaking role for Central Kurdish, which severely lacks public-domain resources for speech translation. Following the dataset division in Common Voice, there are 298k, 6,226, and 7,253 samples in the train, development, and test sets, respectively. The dataset is evaluated on end-to-end English-to-Kurdish S2TT using Whisper V3 Large and SeamlessM4T V2 Large models. The dataset is available under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License https://huggingface.co/datasets/aranemini/kuvost.
2024
RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot
Mohammad Mohammadamini | Driss Matrouf | Michael Rouvier | Jean-Francois Bonastre | Romain Serizel | Theophile Gonos
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Mohammad Mohammadamini | Driss Matrouf | Michael Rouvier | Jean-Francois Bonastre | Romain Serizel | Theophile Gonos
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
In this paper, we introduce a new far-field speaker recognition benchmark called RoboVox. RoboVox is a French corpus recorded by a mobile robot. The files are recorded from different distances under severe acoustical conditions with the presence of several types of noise and reverberation. In addition to noise and reverberation, the robot’s internal noise acts as an extra additive noise. RoboVox can be used for both single-channel and multi-channel speaker recognition. In the evaluation protocols, we are considering both cases. The obtained results demonstrate a significant decline in performance in far-filed speaker recognition and urge the community to further research in this domain
2022
Far-Field Speaker Recognition Benchmark Derived From The DiPCo Corpus
Mickael Rouvier | Mohammad Mohammadamini
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Mickael Rouvier | Mohammad Mohammadamini
Proceedings of the Thirteenth Language Resources and Evaluation Conference
In this paper, we present a far-field speaker verification benchmark derived from the publicly-available DiPCo corpus. This corpus comprise three different tasks that involve enrollment and test conditions with single- and/or multi-channels recordings. The main goal of this corpus is to foster research in far-field and multi-channel text-independent speaker verification. Also, it can be used for other speaker recognition tasks such as dereverberation, denoising and speech enhancement. In addition, we release a Kaldi and SpeechBrain system to facilitate further research. And we validate the evaluation design with a single-microphone state-of-the-art speaker recognition system (i.e. ResNet-101). The results show that the proposed tasks are very challenging. And we hope these resources will inspire the speech community to develop new methods and systems for this challenging domain.
Search
Fix author
Co-authors
- Marie Tahon 3
- Antoine Laurent 2
- Dezheen H. Abdulazeez 1
- David Ifeoluwa Adelani 1
- Victor Agostinelli 1
- Ibrahim Ahmed 1
- Antonios Anastasopoulos 1
- Luisa Bentivogli 1
- Ondřej Bojar 1
- Jean-François Bonastre 1
- Sébastien Bratières 1
- Marine Carpuat 1
- Fabrício Carraro 1
- Roldano Cattoni 1
- Mauro Cettolo 1
- Lizhong Chen 1
- Abuobaida Abdullah Dheyab 1
- Marcello Federico 1
- Marco Gaido 1
- Theophile Gonos 1
- Mahendra Gupta 1
- HyoJung Han 1
- Ali Hatami 1
- Lewis C. Howe 1
- Daban Jaff 1
- Sara Jamal 1
- Dávid Javorský 1
- Yejin Jeon 1
- Marek Kasztelnik 1
- Danni Liu 1
- Nam Luu 1
- Min Ma 1
- Dominik Macháček 1
- Marie Maltais 1
- Driss Matrouf 1
- Evgeny Matusov 1
- Chandresh Kumar Maurya 1
- John Philip McCrae 1
- Amera Ismail Melhum 1
- Chutong Meng 1
- Aveen Jalal Mohammed 1
- Barzan Hussein Mohammed 1
- Yasmin Moslem 1
- Kenton Murray 1
- Satoshi Nakamura 1
- Matteo Negri 1
- Jan Niehues 1
- Atul Kr. Ojha 1
- Hawkar Omar 1
- John E. Ortega 1
- Siqi Ouyang 1
- Sara Papi 1
- Peter Polák 1
- Fabian Retkowski 1
- Michael Rouvier 1
- Mickael Rouvier 1
- Darya Sabr 1
- Imad Saeed Sadeeq 1
- Beatrice Savoldi 1
- Romain Serizel 1
- Claytone Sikasote 1
- Matthias Sperber 1
- Sebastian Stüker 1
- Katsuhito Sudoh 1
- Stephanny Sánchez 1
- Dilgash Mohammed Salih Tayib 1
- Marco Turchi 1
- Alexander Waibel 1
- Patrick Wilken 1
- Rodolfo Zevallos 1
- Vilém Zouhar 1
- Maike Züfle 1