The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese

Ajinkya Kulkarni; Anna Tokareva; Rameez Qureshi; Miguel Couceiro

The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese

Ajinkya Kulkarni, Anna Tokareva, Rameez Qureshi, Miguel Couceiro

Abstract

In the field of spoken language understanding, systems like Whisper and Multilingual Massive Speech (MMS) have shown state-of-the-art performances. This study is dedicated to a comprehensive exploration of the Whisper and MMS systems, with a focus on assessing biases in automatic speech recognition (ASR) inherent to casual conversation speech specific to the Portuguese language. Our investigation encompasses various categories, including gender, age, skin tone color, and geo-location. Alongside traditional ASR evaluation metrics such as Word Error Rate (WER), we have incorporated p-value statistical significance for gender bias analysis. Furthermore, we extensively examine the impact of data distribution and empirically show that oversampling techniques alleviate such stereotypical biases. This research represents a pioneering effort in quantifying biases in the Portuguese language context through the application of MMS and Whisper, contributing to a better understanding of ASR systems’ performance in multilingual settings.

Anthology ID:: 2024.ltedi-1.4
Volume:: Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion
Month:: March
Year:: 2024
Address:: St. Julian's, Malta
Editors:: Bharathi Raja Chakravarthi, Bharathi B, Paul Buitelaar, Thenmozhi Durairaj, György Kovács, Miguel Ángel García Cumbreras
Venues:: LTEDI | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 31–40
Language:
URL:: https://aclanthology.org/2024.ltedi-1.4
DOI:
Bibkey:
Cite (ACL):: Ajinkya Kulkarni, Anna Tokareva, Rameez Qureshi, and Miguel Couceiro. 2024. The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese. In Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion, pages 31–40, St. Julian's, Malta. Association for Computational Linguistics.
Cite (Informal):: The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese (Kulkarni et al., LTEDI-WS 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.ltedi-1.4.pdf

PDF Cite Search