Self-Edit: Fault-Aware Code Editor for Code Generation

Kechi Zhang; Zhuo Li; Jia Li; Ge Li; Zhi Jin

doi:10.18653/v1/2023.acl-long.45

Self-Edit: Fault-Aware Code Editor for Code Generation

Kechi Zhang, Zhuo Li, Jia Li, Ge Li, Zhi Jin

Abstract

Large language models (LLMs) have demonstrated an impressive ability to generate codes on competitive programming tasks. However, with limited sample numbers, LLMs still suffer from poor accuracy. Inspired by the process of human programming, we propose a generate-and-edit approach named Self-Edit that utilizes execution results of the generated code from LLMs to improve the code quality on the competitive programming task. We execute the generated code on the example test case provided in the question and wrap execution results into a supplementary comment. Utilizing this comment as guidance, our fault-aware code editor is employed to correct errors in the generated code. We perform extensive evaluations across two competitive programming datasets with nine different LLMs. Compared to directly generating from LLMs, our approach can improve the average of pass@1 by 89% on APPS-dev, 31% on APPS-test, and 48% on HumanEval over nine popular code generation LLMs with parameter sizes ranging from 110M to 175B. Compared to other post-processing methods, our method demonstrates superior accuracy and efficiency.

Anthology ID:: 2023.acl-long.45
Original:: 2023.acl-long.45v1
Version 2:: 2023.acl-long.45v2
Version 3:: 2023.acl-long.45v3
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 769–787
Language:
URL:: https://aclanthology.org/2023.acl-long.45
DOI:: 10.18653/v1/2023.acl-long.45
Bibkey:
Cite (ACL):: Kechi Zhang, Zhuo Li, Jia Li, Ge Li, and Zhi Jin. 2023. Self-Edit: Fault-Aware Code Editor for Code Generation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 769–787, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Self-Edit: Fault-Aware Code Editor for Code Generation (Zhang et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-long.45.pdf
Video:: https://aclanthology.org/2023.acl-long.45.mp4

PDF (v3) PDF (v1) Cite Search Video