The Superficiality Bias: Community Votes and Answer Utility in Portuguese Health Question Answering

Carlos Henrique Santos Barros; Gustavo Figueredo Rodrigues de Sousa; Rogério Figueredo de Sousa

The Superficiality Bias: Community Votes and Answer Utility in Portuguese Health Question Answering

Carlos Henrique Santos Barros, Gustavo Figueredo Rodrigues de Sousa, Rogério Figueredo de Sousa

Abstract

Supervised models trained on community-labeled data have shown promise in Health Question Answering (HQA), but relying on “likes” as a proxy for clinical usefulness remains controversial. This work investigates the alignment between automated predictions and human perception in Portuguese HQA. Using a subset of the SaudeBR-QA corpus, we compare a Random Forest classifier against a controlled evaluation conducted by laypeople and healthcare professionals. Our results reveal a recurring divergence that we term Superficiality Bias: human evaluators frequently validate very brief answers, whereas the classifier often labels these cases as non-useful under its learned criteria. Rather than indicating that the model is inherently more clinically accurate, this pattern suggests a misalignment between community feedback and feature-driven utility judgments. We argue that crowd-based labels in medical domains should be treated cautiously and complemented with more rigorous annotation protocols.

Anthology ID:: 2026.propor-1.113
Volume:: Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Month:: April
Year:: 2026
Address:: Salvador, Brazil
Editors:: Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:: PROPOR
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1074–1078
Language:
URL:: https://aclanthology.org/2026.propor-1.113/
DOI:
Bibkey:
Cite (ACL):: Carlos Henrique Santos Barros, Gustavo Figueredo Rodrigues de Sousa, and Rogério Figueredo de Sousa. 2026. The Superficiality Bias: Community Votes and Answer Utility in Portuguese Health Question Answering. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1, pages 1074–1078, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):: The Superficiality Bias: Community Votes and Answer Utility in Portuguese Health Question Answering (Barros et al., PROPOR 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.propor-1.113.pdf

PDF Cite Search Fix data