Intervention-Based Alignment of Code Search with Execution Feedback

Hojae Han; Minsoo Kim; Seung-won Hwang; Nan Duan; Shuai Lu

doi:10.18653/v1/2023.findings-emnlp.148

Intervention-Based Alignment of Code Search with Execution Feedback

Hojae Han, Minsoo Kim, Seung-won Hwang, Nan Duan, Shuai Lu

Abstract

One of the fundamental goals in code search is to retrieve a functionally correct code for a given natural language query. As annotating for correctness requires executing test cases (i.e. obtaining execution feedback), existing code search training datasets approximate text-code co-occurrences as positive execution feedback. However, this approximation may misalign models’ retrieval decisions from ground-truth correctness. To address such limitation, we propose Code Intervention-based Reinforcement Learning (CIRL) that perturbs training code to result in misalignment (i.e. code intervention), then tests models’ decisions and corrects them with the execution feedback by reinforcement learning. The first technical contribution of CIRL is to induce the execution feedback from perturbation, without actual execution. Secondly, CIRL introduces structural perturbations using abstract syntax trees, going beyond simple lexical changes. Experimental results on various datasets demonstrate the effectiveness of CIRL compared to conventional approaches.

Anthology ID:: 2023.findings-emnlp.148
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2023
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Houda Bouamor, Juan Pino, Kalika Bali
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2241–2263
Language:
URL:: https://aclanthology.org/2023.findings-emnlp.148
DOI:: 10.18653/v1/2023.findings-emnlp.148
Bibkey:
Cite (ACL):: Hojae Han, Minsoo Kim, Seung-won Hwang, Nan Duan, and Shuai Lu. 2023. Intervention-Based Alignment of Code Search with Execution Feedback. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 2241–2263, Singapore. Association for Computational Linguistics.
Cite (Informal):: Intervention-Based Alignment of Code Search with Execution Feedback (Han et al., Findings 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.findings-emnlp.148.pdf

PDF Cite Search