Yi-Hsiang Hung
2022
A Preliminary Study on Mandarin-Hakka neural machine translation using small-sized data
Yi-Hsiang Hung
|
Yi-Chin Huang
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
In this study, we implemented a machine translation system using the Convolutional Neural Network with Attention mechanism for translating Mandarin to Sixan-accent Hakka. Specifically, to cope with the different idioms or terms used between Northern and Southern Sixan-accent, we analyzed the corpus differences and lexicon definition, and then separated the various word usages for training exclusive models for each accent. Besides, since the collected Hakka corpora are relatively limited, the unseen words frequently occurred during real-world translation. In our system, we selected suitable thresholds for each model based on the model verification to reject non-suitable translated words. Then, by applying the proposed algorithm, which adopted the forced Hakka idioms/terms segmentation and the common Mandarin word substitution, the resultant translation sentences become more intelligible. Therefore, the proposed system achieved promising results using small-sized data. This system could be used for Hakka language teaching and also the front-end of Mandarin and Hakka code-switching speech synthesis systems.
2019
應用文脈分析於中英夾雜語音合成系統(Linguistic Analysis for English/Mandarin Speech Synthesis System)
Yi-Hsiang Hung
|
Yi-Chin Huang
|
Guang-Feng Deng
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing (ROCLING 2019)
Search