Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding Hanling Yi author Feng Lin author Hongbin Li author Ning Peiyang author Xiaotian Yu author Rong Xiao author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication yi-etal-2024-generation 10.18653/v1/2024.findings-acl.313 https://aclanthology.org/2024.findings-acl.313/ 2024-08 5285 5299