EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning

Kinjal Basu; Keerthiram Murugesan; Subhajit Chaudhury; Murray Campbell; Kartik Talamadupula; Tim Klinger

doi:10.18653/v1/2024.eacl-long.24

EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning

Kinjal Basu, Keerthiram Murugesan, Subhajit Chaudhury, Murray Campbell, Kartik Talamadupula, Tim Klinger

Abstract

Text-based games (TBGs) have emerged as an important collection of NLP tasks, requiring reinforcement learning (RL) agents to combine natural language understanding with reasoning. A key challenge for agents attempting to solve such tasks is to generalize across multiple games and demonstrate good performance on both seen and unseen objects. Purely deep-RL-based approaches may perform well on seen objects; however, they fail to showcase the same performance on unseen objects. Commonsense-infused deep-RL agents may work better on unseen data; unfortunately, their policies are often not interpretable or easily transferable. To tackle these issues, in this paper, we present EXPLORER which is an exploration-guided reasoning agent for textual reinforcement learning. EXPLORER is neuro-symbolic in nature, as it relies on a neural module for exploration and a symbolic module for exploitation. It can also learn generalized symbolic policies and perform well over unseen data. Our experiments show that EXPLORER outperforms the baseline agents on Text-World cooking (TW-Cooking) and Text-World Commonsense (TWC) games.

Anthology ID:: 2024.eacl-long.24
Volume:: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2024
Address:: St. Julian’s, Malta
Editors:: Yvette Graham, Matthew Purver
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 394–405
Language:
URL:: https://aclanthology.org/2024.eacl-long.24/
DOI:: 10.18653/v1/2024.eacl-long.24
Bibkey:
Cite (ACL):: Kinjal Basu, Keerthiram Murugesan, Subhajit Chaudhury, Murray Campbell, Kartik Talamadupula, and Tim Klinger. 2024. EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 394–405, St. Julian’s, Malta. Association for Computational Linguistics.
Cite (Informal):: EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning (Basu et al., EACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.eacl-long.24.pdf
Video:: https://aclanthology.org/2024.eacl-long.24.mp4

PDF Cite Search Video Fix data