On the Limit of Language Models as Planning Formalizers

Cassie Huang; Li Zhang

doi:10.18653/v1/2025.acl-long.242

On the Limit of Language Models as Planning Formalizers

Abstract

Large Language Models have been found to create plans that are neither executable nor verifiable in grounded environments. An emerging line of work demonstrates success in using the LLM as a formalizer to generate a formal representation of the planning domain in some language, such as Planning Domain Definition Language (PDDL). This formal representation can be deterministically solved to find a plan. We systematically evaluate this methodology while bridging some major gaps. While previous work only generates a partial PDDL representation, given templated, and therefore unrealistic environment descriptions, we generate the complete representation given descriptions of various naturalness levels. Among an array of observations critical to improve LLMs’ formal planning abilities, we note that most large enough models can effectively formalize descriptions as PDDL, outperforming those directly generating plans, while being robust to lexical perturbation. As the descriptions become more natural-sounding, we observe a decrease in performance and provide detailed error analysis.

Anthology ID:: 2025.acl-long.242
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4880–4904
Language:
URL:: https://aclanthology.org/2025.acl-long.242/
DOI:: 10.18653/v1/2025.acl-long.242
Bibkey:
Cite (ACL):: Cassie Huang and Li Zhang. 2025. On the Limit of Language Models as Planning Formalizers. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4880–4904, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: On the Limit of Language Models as Planning Formalizers (Huang & Zhang, ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.242.pdf

PDF Cite Search Fix data