FairQE: Multi-Agent Framework for Mitigating Gender Bias in Translation Quality Estimation

Jinhee Jang; Juhwan Choi; Dongjin Lee; Seunguk Yu; Youngbin Kim

FairQE: Multi-Agent Framework for Mitigating Gender Bias in Translation Quality Estimation

Jinhee Jang, Juhwan Choi, Dongjin Lee, Seunguk Yu, YoungBin Kim

Abstract

Quality Estimation (QE) aims to assess machine translation quality without reference translations, but recent studies have shown that existing QE models exhibit systematic gender bias. In particular, they tend to favor masculine realizations in gender-ambiguous contexts and may assign higher scores to gender-misaligned translations even when gender is explicitly specified. To address these issues, we propose FairQE, a multi-agent-based, fairness-aware QE framework that mitigates gender bias in both gender-ambiguous and gender-explicit scenarios. FairQE detects gender cues, generates gender-flipped translation variants, and combines conventional QE scores with LLM-based unbiased reasoning through a dynamic bias-aware aggregation mechanism. This design preserves the strengths of existing QE models while calibrating their gender-related biases in a plug-and-play manner. Extensive experiments across multiple gender bias evaluation settings demonstrate that FairQE consistently improves gender fairness over strong QE baselines. Moreover, under MQM-based meta-evaluation following the WMT 2023 Metrics Shared Task, FairQE achieves competitive or improved general QE performance. These results show that gender bias in QE can be effectively mitigated without sacrificing evaluation accuracy, enabling fairer and more reliable translation evaluation.

Anthology ID:: 2026.acl-long.1757
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 37891–37911
Language:
URL:: https://aclanthology.org/2026.acl-long.1757/
DOI:
Bibkey:
Cite (ACL):: Jinhee Jang, Juhwan Choi, Dongjin Lee, Seunguk Yu, and YoungBin Kim. 2026. FairQE: Multi-Agent Framework for Mitigating Gender Bias in Translation Quality Estimation. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 37891–37911, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: FairQE: Multi-Agent Framework for Mitigating Gender Bias in Translation Quality Estimation (Jang et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1757.pdf
Checklist:: 2026.acl-long.1757.checklist.pdf

PDF Cite Search Checklist Fix data