Chun Hsin Wu
Also published as: Chun-Hsin Wu
2022
Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System
Chia-Hsuan Lin
|
Jian-Peng Liao
|
Cho-Chun Hsieh
|
Kai-Chun Liao
|
Chun-Hsin Wu
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
This paper proposes a multi-speaker talking-face synthesis system. The system incorporates voice cloning and lip-syncing technology to achieve text-to-talking-face generation by acquiring audio and video clips of any speaker and using zero-shot transfer learning. In addition, we used open-source corpora to train several Taiwanese-accented models and proposed using Mandarin Phonetic Symbols (Bopomofo) as the character embedding of the synthesizer to improve the system’s ability to synthesize Chinese-English code-switched sentences. Through our system, users can create rich applications. Also, the research on this technology is novel in the audiovisual speech synthesis field.
2004
仿趙氏音高尺度之基週軌跡正規化方法及其應用 (A Pitch-Contour Normalization Method Following Zhao’s Pitch Scale and Its Application) [In Chinese]
Hung-yan Gu
|
Hsiao-Fen Chang
|
Chun Hsin Wu
Proceedings of the 16th Conference on Computational Linguistics and Speech Processing
Search
Co-authors
- Hung-Yan Gu 1
- Hsiao-Fen Chang 1
- Chia-Hsuan Lin 1
- Jian-Peng Liao 1
- Cho-Chun Hsieh 1
- show all...