Whisper Finetuning For Hakka Recognition in Low Resource

Min Han Teng, Ci Dao Chen, You Ting Lin, Bing Jhih Huang


Abstract
We study automatic speech recognition (ASR) for Hakka, a low-resource language with substantial dialectal variation. Focusing on Zhaoan and Dapu, we fine-tune Whisper using Low-Rank Adaptation (LoRA) and apply data augmentation to mitigate data scarcity. Experiments show that LoRA combined with augmentation substantially improves cross-dialect recognition while maintaining parameter efficiency. Our results demonstrate the potential of lightweight adaptation to extend large-scale ASR systems to underrepresented languages, supporting the preservation of Hakka speech and orthography.
Anthology ID:
2025.rocling-main.52
Volume:
Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)
Month:
November
Year:
2025
Address:
National Taiwan University, Taipei City, Taiwan
Editors:
Kai-Wei Chang, Ke-Han Lu, Chih-Kai Yang, Zhi-Rui Tam, Wen-Yu Chang, Chung-Che Wang
Venue:
ROCLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
450–453
Language:
URL:
https://aclanthology.org/2025.rocling-main.52/
DOI:
Bibkey:
Cite (ACL):
Min Han Teng, Ci Dao Chen, You Ting Lin, and Bing Jhih Huang. 2025. Whisper Finetuning For Hakka Recognition in Low Resource. In Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025), pages 450–453, National Taiwan University, Taipei City, Taiwan. Association for Computational Linguistics.
Cite (Informal):
Whisper Finetuning For Hakka Recognition in Low Resource (Teng et al., ROCLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.rocling-main.52.pdf