Learning to Generate Structured Output with Schema Reinforcement Learning

Yaxi Lu; Haolun Li; Xin Cong; Zhong Zhang; Yesai Wu; Yankai Lin (林衍凯); Zhiyuan Liu; Fangming Liu; Maosong Sun

doi:10.18653/v1/2025.acl-long.243

Learning to Generate Structured Output with Schema Reinforcement Learning

Yaxi Lu, Haolun Li, Xin Cong, Zhong Zhang, Yesai Wu, Yankai Lin, Zhiyuan Liu, Fangming Liu, Maosong Sun

Abstract

This study investigates the structured generation capabilities of large language models (LLMs), focusing on producing valid JSON outputs against a given schema. Despite the widespread use of JSON in integrating language models with programs, there is a lack of comprehensive analysis and benchmarking of these capabilities. We explore various aspects of JSON generation, such as structure understanding, escaping, and natural language description, to determine how to assess and enable LLMs to generate valid responses. Building upon this, we propose SchemaBench features around 40K different JSON schemas to obtain and assess models’ abilities in generating valid JSON. We find that the latest LLMs are still struggling to generate a valid JSON string. Moreover, we demonstrate that incorporating reinforcement learning with a Fine-grained Schema Validator can further enhance models’ understanding of JSON schema, leading to improved performance. Our models demonstrate significant improvement in both generating JSON outputs and downstream tasks.

Anthology ID:: 2025.acl-long.243
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4905–4918
Language:
URL:: https://aclanthology.org/2025.acl-long.243/
DOI:: 10.18653/v1/2025.acl-long.243
Bibkey:
Cite (ACL):: Yaxi Lu, Haolun Li, Xin Cong, Zhong Zhang, Yesai Wu, Yankai Lin, Zhiyuan Liu, Fangming Liu, and Maosong Sun. 2025. Learning to Generate Structured Output with Schema Reinforcement Learning. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4905–4918, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Learning to Generate Structured Output with Schema Reinforcement Learning (Lu et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.243.pdf

PDF Cite Search Fix data