Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation Heming Xia author Tao Ge author Peiyi Wang author Si-Qing Chen author Furu Wei author Zhifang Sui author 2023-12 text Findings of the Association for Computational Linguistics: EMNLP 2023 Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication xia-etal-2023-speculative 10.18653/v1/2023.findings-emnlp.257 https://aclanthology.org/2023.findings-emnlp.257/ 2023-12 3909 3925