LEGENT: Open Platform for Embodied Agents

Zhili Cheng; Zhitong Wang; Jinyi Hu; Shengding Hu; An Liu; Yuge Tu; Pengkai Li; Lei Shi; Zhiyuan Liu; Maosong Sun

doi:10.18653/v1/2024.acl-demos.32

LEGENT: Open Platform for Embodied Agents

Zhili Cheng, Zhitong Wang, Jinyi Hu, Shengding Hu, An Liu, Yuge Tu, Pengkai Li, Lei Shi, Zhiyuan Liu, Maosong Sun

Abstract

Despite advancements in Large Language Models (LLMs) and Large Multimodal Models (LMMs), their integration into language-grounded, human-like embodied agents remains incomplete, hindering complex real-life task performance in 3D environments. Existing integrations often feature limited open-sourcing, challenging collective progress in this field. We introduce LEGENT, an open, scalable platform for developing embodied agents using LLMs and LMMs. LEGENT offers a dual approach: a rich 3D environment with interactive, communicable, and actionable agents, paired with a user-friendly interface, and a sophisticated data generation pipeline utilizing advanced algorithms to exploit supervision from simulated worlds at scale. In our experiments, an embryonic vision-language-action model trained on LEGENT-generated data surpasses GPT-4V in embodied tasks, showcasing promising generalization capabilities. The demo video is available at the following link https://video.legent.ai.

Anthology ID:: 2024.acl-demos.32
Volume:: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Yixin Cao, Yang Feng, Deyi Xiong
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 335–345
Language:
URL:: https://aclanthology.org/2024.acl-demos.32/
DOI:: 10.18653/v1/2024.acl-demos.32
Bibkey:
Cite (ACL):: Zhili Cheng, Zhitong Wang, Jinyi Hu, Shengding Hu, An Liu, Yuge Tu, Pengkai Li, Lei Shi, Zhiyuan Liu, and Maosong Sun. 2024. LEGENT: Open Platform for Embodied Agents. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 335–345, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: LEGENT: Open Platform for Embodied Agents (Cheng et al., ACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.acl-demos.32.pdf

PDF Cite Search Fix data