Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages

Andrei Popescu-Belis, Alexis Allemann, Teo Ferrari, Gopal Krishnamani


Abstract
The popularity of automatic speech-to-speech translation for human conversations is growing, but the quality varies significantly depending on the language pair. In a context of community interpreting for low-resource languages, namely Turkish and Pashto to/from French, we collected fine-tuning and testing data, and compared systems using several automatic metrics (BLEU, COMET, and BLASER) and human assessments. The pipelines consist of automatic speech recognition, machine translation, and speech synthesis, with local models and cloud-based commercial ones. Some components have been fine-tuned on our data. We evaluated over 60 pipelines and determined the best one for each direction. We also found that the ranks of components are generally independent of the rest of the pipeline.
Anthology ID:
2025.mtsummit-2.3
Volume:
Proceedings of Machine Translation Summit XX: Volume 2
Month:
June
Year:
2025
Address:
Geneva, Switzerland
Editors:
Pierrette Bouillon, Johanna Gerlach, Sabrina Girletti, Lise Volkart, Raphael Rubino, Rico Sennrich, Samuel Läubli, Martin Volk, Miquel Esplà-Gomis, Vincent Vandeghinste, Helena Moniz, Sara Szoc
Venue:
MTSummit
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
18–27
Language:
URL:
https://aclanthology.org/2025.mtsummit-2.3/
DOI:
Bibkey:
Cite (ACL):
Andrei Popescu-Belis, Alexis Allemann, Teo Ferrari, and Gopal Krishnamani. 2025. Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages. In Proceedings of Machine Translation Summit XX: Volume 2, pages 18–27, Geneva, Switzerland. European Association for Machine Translation.
Cite (Informal):
Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages (Popescu-Belis et al., MTSummit 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.mtsummit-2.3.pdf