Retrieval-Based Neural Code Generation

Shirley Anugrah Hayati; Raphael Olivier; Pravalika Avvaru; Pengcheng Yin; Anthony Tomasic; Graham Neubig

doi:10.18653/v1/D18-1111

Retrieval-Based Neural Code Generation

Shirley Anugrah Hayati, Raphael Olivier, Pravalika Avvaru, Pengcheng Yin, Anthony Tomasic, Graham Neubig

Abstract

In models to generate program source code from natural language, representing this code in a tree structure has been a common approach. However, existing methods often fail to generate complex code correctly due to a lack of ability to memorize large and complex structures. We introduce RECODE, a method based on subtree retrieval that makes it possible to explicitly reference existing code examples within a neural code generation model. First, we retrieve sentences that are similar to input sentences using a dynamic-programming-based sentence similarity scoring method. Next, we extract n-grams of action sequences that build the associated abstract syntax tree. Finally, we increase the probability of actions that cause the retrieved n-gram action subtree to be in the predicted code. We show that our approach improves the performance on two code generation tasks by up to +2.6 BLEU.

Anthology ID:: D18-1111
Volume:: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Month:: October-November
Year:: 2018
Address:: Brussels, Belgium
Editors:: Ellen Riloff, David Chiang, Julia Hockenmaier, Jun’ichi Tsujii
Venue:: EMNLP
SIG:: SIGDAT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 925–930
Language:
URL:: https://aclanthology.org/D18-1111/
DOI:: 10.18653/v1/D18-1111
Bibkey:
Cite (ACL):: Shirley Anugrah Hayati, Raphael Olivier, Pravalika Avvaru, Pengcheng Yin, Anthony Tomasic, and Graham Neubig. 2018. Retrieval-Based Neural Code Generation. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 925–930, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):: Retrieval-Based Neural Code Generation (Hayati et al., EMNLP 2018)
Copy Citation:
PDF:: https://aclanthology.org/D18-1111.pdf
Video:: https://aclanthology.org/D18-1111.mp4
Code: sweetpeach/ReCode
Data: Django, Hearthstone

PDF Cite Search Code Video Fix data