Gender, names and other mysteries: Towards the ambiguous for gender-inclusive translation

Danielle Saunders, Katrina Olsen


Abstract
The vast majority of work on gender in MT focuses on ‘unambiguous’ inputs, where gender markers in the source language are expected to be resolved in the output. Conversely, this paper explores the widespread case where the source sentence lacks explicit gender markers, but the target sentence contains them due to richer grammatical gender. We particularly focus on inputs containing person names. Investigating such sentence pairs casts a new light on research into MT gender bias and its mitigation. We find that many name-gender co-occurrences in MT data are not resolvable with ‘unambiguous gender’ in the source language, and that gender-ambiguous examples can make up a large proportion of training examples. From this, we discuss potential steps toward gender-inclusive translation which accepts the ambiguity in both gender and translation.
Anthology ID:
2023.gitt-1.8
Volume:
Proceedings of the First Workshop on Gender-Inclusive Translation Technologies
Month:
June
Year:
2023
Address:
Tampere, Finland
Editors:
Eva Vanmassenhove, Beatrice Savoldi, Luisa Bentivogli, Joke Daems, Janiça Hackenbuchner
Venue:
GITT
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
85–93
Language:
URL:
https://aclanthology.org/2023.gitt-1.8
DOI:
Bibkey:
Cite (ACL):
Danielle Saunders and Katrina Olsen. 2023. Gender, names and other mysteries: Towards the ambiguous for gender-inclusive translation. In Proceedings of the First Workshop on Gender-Inclusive Translation Technologies, pages 85–93, Tampere, Finland. European Association for Machine Translation.
Cite (Informal):
Gender, names and other mysteries: Towards the ambiguous for gender-inclusive translation (Saunders & Olsen, GITT 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.gitt-1.8.pdf