Evaluating the Impact of Stereotypes and Language Combinations on Gender Bias Occurrence in NMT Generic Systems

Bertille Triboulet, Pierrette Bouillon


Abstract
Machine translation, and more specifically neural machine translation (NMT), have been proven to be subject to gender bias in recent years. Many studies have focused on evaluating and reducing this phenomenon, mainly through the analysis of occupational nouns’ translation for the same type of language combinations. In this paper, we reproduce a similar test set than in previous studies to investigate the influence of stereotypes and language combinations’ nature (formed with English, French and Italian) on gender bias occurrence in NMT. Similarly to previous studies, we confirm stereotypes as a major source of gender bias, especially in female contexts, while observing bias even in language combinations traditionally less examined.
Anthology ID:
2023.ltedi-1.9
Volume:
Proceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Bharathi R. Chakravarthi, B. Bharathi, Joephine Griffith, Kalika Bali, Paul Buitelaar
Venues:
LTEDI | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
62–70
Language:
URL:
https://aclanthology.org/2023.ltedi-1.9
DOI:
Bibkey:
Cite (ACL):
Bertille Triboulet and Pierrette Bouillon. 2023. Evaluating the Impact of Stereotypes and Language Combinations on Gender Bias Occurrence in NMT Generic Systems. In Proceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion, pages 62–70, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Evaluating the Impact of Stereotypes and Language Combinations on Gender Bias Occurrence in NMT Generic Systems (Triboulet & Bouillon, LTEDI-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.ltedi-1.9.pdf