Martin Wessel
2024
Beyond the Surface: Spurious Cues in Automatic Media Bias Detection
Martin Wessel
|
Tomáš Horych
Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion
This study investigates the robustness and generalization of transformer-based models for automatic media bias detection. We explore the behavior of current bias classifiers by analyzing feature attributions and stress-testing with adversarial datasets. The findings reveal a disproportionate focus on rare but strongly connotated words, suggesting a rather superficial understanding of linguistic bias and challenges in contextual interpretation. This problem is further highlighted by inconsistent bias assessment when stress-tested with different entities and minorities. Enhancing automatic media bias detection models is critical to improving inclusivity in media, ensuring balanced and fair representation of diverse perspectives.
Search