Meta-Reinforcement Learning for Mastering Multiple Skills and Generalizing across Environments in Text-based Games

Zhenjie Zhao, Mingfei Sun, Xiaojuan Ma


Abstract
Text-based games can be used to develop task-oriented text agents for accomplishing tasks with high-level language instructions, which has potential applications in domains such as human-robot interaction. Given a text instruction, reinforcement learning is commonly used to train agents to complete the intended task owing to its convenience of learning policies automatically. However, because of the large space of combinatorial text actions, learning a policy network that generates an action word by word with reinforcement learning is challenging. Recent research works show that imitation learning provides an effective way of training a generation-based policy network. However, trained agents with imitation learning are hard to master a wide spectrum of task types or skills, and it is also difficult for them to generalize to new environments. In this paper, we propose a meta reinforcement learning based method to train text agents through learning-to-explore. In particular, the text agent first explores the environment to gather task-specific information and then adapts the execution policy for solving the task with this information. On the publicly available testbed ALFWorld, we conducted a comparison study with imitation learning and show the superiority of our method.
Anthology ID:
2021.metanlp-1.1
Volume:
Proceedings of the 1st Workshop on Meta Learning and Its Applications to Natural Language Processing
Month:
August
Year:
2021
Address:
Online
Editors:
Hung-Yi Lee, Mitra Mohtarami, Shang-Wen Li, Di Jin, Mandy Korpusik, Shuyan Dong, Ngoc Thang Vu, Dilek Hakkani-Tur
Venue:
MetaNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–10
Language:
URL:
https://aclanthology.org/2021.metanlp-1.1
DOI:
10.18653/v1/2021.metanlp-1.1
Bibkey:
Cite (ACL):
Zhenjie Zhao, Mingfei Sun, and Xiaojuan Ma. 2021. Meta-Reinforcement Learning for Mastering Multiple Skills and Generalizing across Environments in Text-based Games. In Proceedings of the 1st Workshop on Meta Learning and Its Applications to Natural Language Processing, pages 1–10, Online. Association for Computational Linguistics.
Cite (Informal):
Meta-Reinforcement Learning for Mastering Multiple Skills and Generalizing across Environments in Text-based Games (Zhao et al., MetaNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.metanlp-1.1.pdf
Data
ALFREDALFWorld