Wenzhou Dialect Speech to Mandarin Text Conversion

Zhipeng Gao; Akihiro Tamura; Tsuneo Kato

doi:10.18653/v1/2025.loresmt-1.5

Wenzhou Dialect Speech to Mandarin Text Conversion

Zhipeng Gao, Akihiro Tamura, Tsuneo Kato

Abstract

The Wenzhou dialect is a Chinese dialect that is significantly distinct from Mandarin, the official language of China. It is among the most complex Chinese dialects and is nearly incomprehensible to people from regions such as Northern China, thereby creating substantial communication barriers. Therefore, the conversion between the Wenzhou dialect and Mandarin is essential to facilitate communication between Wenzhou dialect speakers and those from other Chinese regions. However, as a low-resource language, the Wenzhou dialect lacks publicly available datasets, and such conversion technologies have not been extensively researched. Thus, in this study, we create a parallel dataset containing Wenzhou dialect speech and the corresponding Mandarin text and build benchmark models for Wenzhou dialect speech-to-Mandarin text conversion. In particular, we fine-tune two self-supervised learning-based pretrained models, that is, TeleSpeech-ASR1.0 and Wav2Vec2-XLS-R, with our training dataset and report their performance on our test dataset as baselines for future research.

Anthology ID:: 2025.loresmt-1.5
Volume:: Proceedings of the Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2025)
Month:: May
Year:: 2025
Address:: Albuquerque, New Mexico, U.S.A.
Editors:: Atul Kr. Ojha, Chao-hong Liu, Ekaterina Vylomova, Flammie Pirinen, Jonathan Washington, Nathaniel Oco, Xiaobing Zhao
Venues:: LoResMT | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 36–43
Language:
URL:: https://aclanthology.org/2025.loresmt-1.5/
DOI:: 10.18653/v1/2025.loresmt-1.5
Bibkey:
Cite (ACL):: Zhipeng Gao, Akihiro Tamura, and Tsuneo Kato. 2025. Wenzhou Dialect Speech to Mandarin Text Conversion. In Proceedings of the Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2025), pages 36–43, Albuquerque, New Mexico, U.S.A.. Association for Computational Linguistics.
Cite (Informal):: Wenzhou Dialect Speech to Mandarin Text Conversion (Gao et al., LoResMT 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.loresmt-1.5.pdf

PDF Cite Search Fix data