LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training

Qi Shi, Qian Liu, Bei Chen, Yu Zhang, Ting Liu, Jian-Guang Lou


Abstract
Language-based environment manipulation requires agents to manipulate the environment following natural language instructions, which is challenging due to the huge space of the environments. To address this challenge, various approaches have been proposed in recent work. Although these approaches work well for their intended environments, they are difficult to generalize across environments. In this work, we propose LEMON, a general framework for language-based environment manipulation tasks. Specifically, we first specify a general approach for language-based environment manipulation tasks, which can deal with various environments using the same generative language model. Then we propose an execution-guided pre-training strategy to inject prior knowledge of environments to the language model with a pure synthetic pre-training corpus. Experimental results on tasks including Alchemy, Scene, Tangrams, ProPara and Recipes demonstrate the effectiveness of LEMON: it achieves new state-of-the-art results on four of the tasks, and the execution-guided pre-training strategy brings remarkable improvements on all experimental tasks.
Anthology ID:
2022.findings-emnlp.33
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2022
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
471–485
Language:
URL:
https://aclanthology.org/2022.findings-emnlp.33
DOI:
10.18653/v1/2022.findings-emnlp.33
Bibkey:
Cite (ACL):
Qi Shi, Qian Liu, Bei Chen, Yu Zhang, Ting Liu, and Jian-Guang Lou. 2022. LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 471–485, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training (Shi et al., Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-emnlp.33.pdf
Video:
 https://aclanthology.org/2022.findings-emnlp.33.mp4