Speech Recognition for Low-resource Languages: A Comparative Study on Hakka Han Characters and Romanization

Yu-Hsiang Cheng, Yi-Syuan Wu


Abstract
This study focuses on speech recognition for low-resource languages, with Hakka as the case study. Since there is currently a lack of dedicated speech models for Taiwanese Southern Min, Hakka, and indigenous languages, we adopt OpenAI Whisper-Medium as the base model and apply Low-Rank Adaptation (LoRA) for fine-tuning. Two models with different output forms were developed: a Hakka character-based model and a Hakka phonetic-based model. The experimental dataset contains approximately 80 hours of speech, covering the Dapu and Zhao’an dialects, and the models were evaluated using Character Error Rate (CER) and Word Error Rate (WER).
Anthology ID:
2025.rocling-main.49
Volume:
Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)
Month:
November
Year:
2025
Address:
National Taiwan University, Taipei City, Taiwan
Editors:
Kai-Wei Chang, Ke-Han Lu, Chih-Kai Yang, Zhi-Rui Tam, Wen-Yu Chang, Chung-Che Wang
Venue:
ROCLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
435–440
Language:
URL:
https://aclanthology.org/2025.rocling-main.49/
DOI:
Bibkey:
Cite (ACL):
Yu-Hsiang Cheng and Yi-Syuan Wu. 2025. Speech Recognition for Low-resource Languages: A Comparative Study on Hakka Han Characters and Romanization. In Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025), pages 435–440, National Taiwan University, Taipei City, Taiwan. Association for Computational Linguistics.
Cite (Informal):
Speech Recognition for Low-resource Languages: A Comparative Study on Hakka Han Characters and Romanization (Cheng & Wu, ROCLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.rocling-main.49.pdf