Gendered Stylistic Variation in Brazilian Portuguese Google Play Reviews: A Large-Scale Study

Tiago de Melo


Abstract
We study gender-associated stylistic variation in Brazilian Portuguese Google Play reviews. Using IBGE name frequencies, we infer binary gender from first names in 76.7M reviews (96 apps, 2011–2025), obtaining 22.25M high-confidence labels. Women-associated reviews show markedly higher paralinguistic expressivity (about 60% higher emoji density and more lengthening/punctuation), while lexical diversity (MTLD) is nearly identical across groups. Ratings are mostly positive, with men contributing relatively more 1-star reviews and women more 5-star reviews. These findings contribute to a deeper understanding of digital sociolinguistic behavior within the Brazilian context. We discuss limitations of name-based gender inference and future demographic extensions.
Anthology ID:
2026.propor-2.32
Volume:
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 2
Month:
April
Year:
2026
Address:
Salvador, Brazil
Editors:
Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:
PROPOR
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
238–246
Language:
URL:
https://aclanthology.org/2026.propor-2.32/
DOI:
Bibkey:
Cite (ACL):
Tiago de Melo. 2026. Gendered Stylistic Variation in Brazilian Portuguese Google Play Reviews: A Large-Scale Study. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 2, pages 238–246, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):
Gendered Stylistic Variation in Brazilian Portuguese Google Play Reviews: A Large-Scale Study (Melo, PROPOR 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.propor-2.32.pdf