Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs Chenxi Sun author Hongzhi Zhang author Zijia Lin author Jingyuan Zhang author Fuzheng Zhang author Zhongyuan Wang author Bin Chen author Chengru Song author Di Zhang author Kun Gai author Deyi Xiong author 2024-05 text Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) Nicoletta Calzolari editor Min-Yen Kan editor Veronique Hoste editor Alessandro Lenci editor Sakriani Sakti editor Nianwen Xue editor ELRA and ICCL Torino, Italia conference publication sun-etal-2024-decoding https://aclanthology.org/2024.lrec-main.401/ 2024-05 4476 4487