Danielle Bragg
2024
Studying and Mitigating Biases in Sign Language Understanding Models
Katherine Atwell
|
Danielle Bragg
|
Malihe Alikhani
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Ensuring that the benefits of sign language technologies are distributed equitably among all community members is crucial. Thus, it is important to address potential biases and inequities that may arise from the design or use of these resources. Crowd-sourced sign language datasets, such as the ASL Citizen dataset, are great resources for improving accessibility and preserving linguistic diversity, but they must be used thoughtfully to avoid reinforcing existing biases.In this work, we utilize the rich information about participant demographics and lexical features present in the ASL Citizen dataset to study and document the biases that may result from models trained on crowd-sourced sign datasets. Further, we apply several bias mitigation techniques during model training, and find that these techniques reduce performance disparities without decreasing accuracy. With the publication of this work, we release the demographic information about the participants in the ASL Citizen dataset to encourage future bias mitigation work in this space.
ASL STEM Wiki: Dataset and Benchmark for Interpreting STEM Articles
Kayo Yin
|
Chinmay Singh
|
Fyodor Minakov
|
Vanessa Milan
|
Hal Daumé Iii
|
Cyril Zhang
|
Alex Lu
|
Danielle Bragg
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Deaf and hard-of-hearing (DHH) students face significant barriers in accessing science, technology, engineering, and mathematics (STEM) education, notably due to the scarcity of STEM resources in signed languages. To help address this, we introduce ASL STEM Wiki: a parallel corpus of 254 Wikipedia articles on STEM topics in English, interpreted into over 300 hours of American Sign Language (ASL). ASL STEM Wiki is the first continuous signing dataset focused on STEM, facilitating the development of AI resources for STEM education in ASL.We identify several use cases of ASL STEM Wiki with human-centered applications. For example, because this dataset highlights the frequent use of fingerspelling for technical concepts, which inhibits DHH students’ ability to learn,we develop models to identify fingerspelled words—which can later be used to query for appropriate ASL signs to suggest to interpreters.
Search
Co-authors
- Katherine Atwell 1
- Malihe Alikhani 1
- Kayo Yin 1
- Chinmay Singh 1
- Fyodor Minakov 1
- show all...