Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality

Rahul Zalkikar; Kanchan Chandra

doi:10.18653/v1/2025.acl-long.68

Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality

Abstract

Innovative transformer-based language models produce contextually-aware token embeddings and have achieved state-of-the-art performance for a variety of natural language tasks, but have been shown to encode unwanted biases for downstream applications. In this paper, we evaluate the social biases encoded by transformers trained with the masked language modeling objective using proposed proxy functions within an iterative masking experiment to measure the quality of transformer models’ predictions and assess the preference of MLMs towards disadvantaged and advantaged groups. We find that all models encode concerning social biases. We compare bias estimations with those produced by other evaluation methods using benchmark datasets and assess their alignment with human annotated biases. We extend previous work by evaluating social biases introduced after retraining an MLM under the masked language modeling objective and find proposed measures produce more accurate and sensitive estimations of biases based on relative preference for biased sentences between models, while other methods tend to underestimate biases after retraining on sentences biased towards disadvantaged groups.

Anthology ID:: 2025.acl-long.68
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1337–1361
Language:
URL:: https://aclanthology.org/2025.acl-long.68/
DOI:: 10.18653/v1/2025.acl-long.68
Bibkey:
Cite (ACL):: Rahul Zalkikar and Kanchan Chandra. 2025. Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1337–1361, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality (Zalkikar & Chandra, ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.68.pdf

PDF Cite Search Fix data