Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine Translation Bar Iluz author Tomasz Limisiewicz author Gabriel Stanovsky author David Mareček author 2023-11 text Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers) Jong C Park editor Yuki Arase editor Baotian Hu editor Wei Lu editor Derry Wijaya editor Ayu Purwarianti editor Adila Alfa Krisnadhi editor Association for Computational Linguistics Nusa Dua, Bali conference publication iluz-etal-2023-exploring 10.18653/v1/2023.ijcnlp-main.57 https://aclanthology.org/2023.ijcnlp-main.57/ 2023-11 885 896