Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling

Shenzhi Wang; Chang Liu (刘畅); Zilong Zheng; Siyuan Qi; Shuo Chen; Qisen Yang; Andrew Zhao; Chaofei Wang; Shiji Song; Gao Huang

doi:10.18653/v1/2024.findings-acl.591

Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling

Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, Gao Huang

Abstract

Recent advances in large language models (LLMs) have led to significant success in using LLMs as agents. Nevertheless, a common assumption that LLMs always process honest information neglects the widespread deceptive or misleading content in human and AI-generated material. This oversight might expose LLMs to malicious manipulations. To enhance LLMs’ ability to identify and counteract deceptive information, in this paper, inspired by humans’ recursive thinking and perspective-taking, we introduce a novel cognitive framework, Recursive Contemplation (ReCon). ReCon combines formulation and refinement contemplation processes; formulation contemplation produces initial thoughts and speech, while refinement contemplation further polishes them. Additionally, we incorporate first-order and second-order perspective transitions into these processes respectively. Specifically, the first-order allows an LLM agent to infer others’ mental states, and the second-order involves understanding how others perceive the agent’s mental state. After integrating ReCon with various LLMs, extensive experiment results from the Avalon game and BigTom benchmark indicate ReCon’s efficacy in aiding LLMs to discern and maneuver around deceptive information without extra fine-tuning and data. Finally, we demonstrate ReCon’s scaling trend with model parameters, and explore the current limitations of LLMs in terms of safety and reasoning, potentially furnishing insights for subsequent research. Our project page can be found at https://shenzhi-wang.github.io/avalon_recon.

Anthology ID:: 2024.findings-acl.591
Volume:: Findings of the Association for Computational Linguistics: ACL 2024
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9909–9953
Language:
URL:: https://aclanthology.org/2024.findings-acl.591/
DOI:: 10.18653/v1/2024.findings-acl.591
Bibkey:
Cite (ACL):: Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, and Gao Huang. 2024. Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling. In Findings of the Association for Computational Linguistics: ACL 2024, pages 9909–9953, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling (Wang et al., Findings 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.findings-acl.591.pdf

PDF Cite Search Fix data