Different Speech Translation Models Encode and Translate Speaker Gender Differently

Dennis Fucci; Marco Gaido; Matteo Negri; Luisa Bentivogli; André F. T. Martins; Giuseppe Attanasio

doi:10.18653/v1/2025.acl-short.78

Different Speech Translation Models Encode and Translate Speaker Gender Differently

Dennis Fucci, Marco Gaido, Matteo Negri, Luisa Bentivogli, Andre Martins, Giuseppe Attanasio

Abstract

Recent studies on interpreting the hidden states of speech models have shown their ability to capture speaker-specific features, including gender. Does this finding also hold for speech translation (ST) models? If so, what are the implications for the speaker’s gender assignment in translation? We address these questions from an interpretability perspective, using probing methods to assess gender encoding across diverse ST models. Results on three language directions (English → French/Italian/Spanish) indicate that while traditional encoder-decoder models capture gender information, newer architectures—integrating a speech encoder with a machine translation system via adapters—do not. We also demonstrate that low gender encoding capabilities result in systems’ tendency toward a masculine default, a translation bias that is more pronounced in newer architectures.

Anthology ID:: 2025.acl-short.78
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1005–1019
Language:
URL:: https://aclanthology.org/2025.acl-short.78/
DOI:: 10.18653/v1/2025.acl-short.78
Bibkey:
Cite (ACL):: Dennis Fucci, Marco Gaido, Matteo Negri, Luisa Bentivogli, Andre Martins, and Giuseppe Attanasio. 2025. Different Speech Translation Models Encode and Translate Speaker Gender Differently. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 1005–1019, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Different Speech Translation Models Encode and Translate Speaker Gender Differently (Fucci et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-short.78.pdf

PDF Cite Search Fix data