Michal Měchura


2022

pdf bib
A Taxonomy of Bias-Causing Ambiguities in Machine Translation
Michal Měchura
Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP)

This paper introduces a taxonomy of phenomena which cause bias in machine translation, covering gender bias (people being male and/or female), number bias (singular you versus plural you) and formality bias (informal you versus formal you). Our taxonomy is a formalism for describing situations in machine translation when the source text leaves some of these properties unspecified (eg. does not say whether doctor is male or female) but the target language requires the property to be specified (eg. because it does not have a gender-neutral word for doctor). The formalism described here is used internally by a web-based tool we have built for detecting and correcting bias in the output of any machine translator.
Search
Co-authors
    Venues
    Fix data