CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Zhongyuan Peng; Yifan Yao; Kaijing Ma; Shuyue Guo; Yizhe Li; Yichi Zhang; Chenchen Zhang; Yifan Zhang; Zhouliang Yu; Luming Li; Minghao Liu; Yihang Xia; Jiawei Shen; Yuchen Wu; Yixin Cao; Zhaoxiang Zhang; Wenhao Huang; Jiaheng Liu; Ge Zhang

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Zhongyuan Peng, Yifan Yao, Kaijing Ma, Shuyue Guo, Yizhe Li, Yichi Zhang, Chenchen Zhang, Yifan Zhang, Zhouliang Yu, Luming Li, Minghao Liu, Yihang Xia, Jiawei Shen, Yuchen Wu, Yixin Cao, Zhaoxiang Zhang, Wenhao Huang, Jiaheng Liu, Ge Zhang

Abstract

Translating natural language mathematical statements into formal, executable code is a fundamental challenge in automated theorem proving. While prior work has focused on generation and compilation success, little attention has been paid to the critic phase—the evaluation of whether generated formalizations truly capture the semantic intent of the original problem. In this paper, we introduce CriticLean, a novel critic-guided reinforcement learning framework that elevates the role of the critic from a passive validator to an active learning component. Specifically, first, we propose the CriticLeanGPT, trained via supervised fine-tuning and reinforcement learning, to rigorously assess the semantic fidelity of Lean 4 formalizations. Then, we introduce CriticLeanBench, a benchmark designed to measure models’ ability to distinguish semantically correct from incorrect formalizations, and demonstrate that our trained CriticLeanGPT models can significantly outperform strong open- and closed-source baselines. Building on the CriticLean framework, we construct FineLeanCorpus, a dataset comprising over 509K problems that exhibits rich domain diversity, broad difficulty coverage, and high correctness based on human evaluation.Overall, our findings highlight that optimizing the critic phase is essential for producing reliable formalizations and we hope our CriticLean will provide valuable insights for future advances in formal mathematical reasoning.

Anthology ID:: 2026.acl-long.139
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3049–3088
Language:
URL:: https://aclanthology.org/2026.acl-long.139/
DOI:
Bibkey:
Cite (ACL):: Zhongyuan Peng, Yifan Yao, Kaijing Ma, Shuyue Guo, Yizhe Li, Yichi Zhang, Chenchen Zhang, Yifan Zhang, Zhouliang Yu, Luming Li, Minghao Liu, Yihang Xia, Jiawei Shen, Yuchen Wu, Yixin Cao, Zhaoxiang Zhang, Wenhao Huang, Jiaheng Liu, and Ge Zhang. 2026. CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3049–3088, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization (Peng et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.139.pdf
Checklist:: 2026.acl-long.139.checklist.pdf

PDF Cite Search Checklist Fix data