SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Junyi Ao author Rui Wang author Long Zhou author Chengyi Wang author Shuo Ren author Yu Wu author Shujie Liu author Tom Ko author Qing Li author Yu Zhang author Zhihua Wei author Yao Qian author Jinyu Li author Furu Wei author 2022-05 text Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Smaranda Muresan editor Preslav Nakov editor Aline Villavicencio editor Association for Computational Linguistics Dublin, Ireland conference publication ao-etal-2022-speecht5 10.18653/v1/2022.acl-long.393 https://aclanthology.org/2022.acl-long.393/ 2022-05 5723 5738