Leveraging Weak Segment Labels for Robust Automated Speaking Assessment in Read-Aloud Tasks

Yue-Yang He; Berlin Chen

Leveraging Weak Segment Labels for Robust Automated Speaking Assessment in Read-Aloud Tasks

Abstract

Automated speaking assessment (ASA) has become a crucial component in computer-assisted language learning, providing scalable, objective, and timely feedback to second-language learners. While early ASA systems relied on hand-crafted features and shallow classifiers, recent advances in self-supervised learning (SSL) have enabled richer representations for both text and speech, improving assessment accuracy. Despite these advances, challenges remain in evaluating long speech responses, due to limited labeled data, class imbalance, and the importance of pronunciation clarity and fluency, especially for read-aloud tasks. In this work, we propose a segment-based ASA framework leveraging WhisperX to split long responses into shorter fragments, generate weak labels from holistic scores, and aggregate segment-level predictions to obtain final proficiency scores. Experiments on the GEPT corpus demonstrate that our framework outperforms baseline holistic models, generalizes robustly to unseen prompts and speakers, and provides diagnostic insights at both segment and response levels.

Anthology ID:: 2025.rocling-main.18
Volume:: Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)
Month:: November
Year:: 2025
Address:: National Taiwan University, Taipei City, Taiwan
Editors:: Kai-Wei Chang, Ke-Han Lu, Chih-Kai Yang, Zhi-Rui Tam, Wen-Yu Chang, Chung-Che Wang
Venue:: ROCLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 146–152
Language:
URL:: https://aclanthology.org/2025.rocling-main.18/
DOI:
Bibkey:
Cite (ACL):: Yue-Yang He and Berlin Chen. 2025. Leveraging Weak Segment Labels for Robust Automated Speaking Assessment in Read-Aloud Tasks. In Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025), pages 146–152, National Taiwan University, Taipei City, Taiwan. Association for Computational Linguistics.
Cite (Informal):: Leveraging Weak Segment Labels for Robust Automated Speaking Assessment in Read-Aloud Tasks (He & Chen, ROCLING 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.rocling-main.18.pdf

PDF Cite Search Fix data