NL2Lean: Translating Natural Language into Lean 4 through Multi-Aspect Reinforcement Learning

Yue Fang; Shaohan Huang; Xin Yu; Haizhen Huang; Zihan Zhang; Weiwei Deng; Furu Wei; Feng Sun; Qi Zhang (张琦); Zhi Jin

NL2Lean: Translating Natural Language into Lean 4 through Multi-Aspect Reinforcement Learning

Yue Fang, Shaohan Huang, Xin Yu, Haizhen Huang, Zihan Zhang, Weiwei Deng, Furu Wei, Feng Sun, Qi Zhang, Zhi Jin

Abstract

Translating natural language into formal language such as Lean 4 has gained attention for its potential to automate formal proof development. Automated methods provide a scalable and cost-effective alternative to manual formalization, driving increasing interest in this task. However, existing LLMs mainly rely on instruction tuning and lack fine-grained structural and semantic alignment, making it difficult to generate syntactically and logically sound formal proofs.To address this, we propose a reinforcement learning framework ReLean that enables LLMs to generate high-quality Lean 4 statements from natural language.We first fine-tune a LLaMA3-8B model on NL–Lean 4 data to obtain a base translator with basic translation ability. Then, we design a multi-aspect dense reward mechanism covering four key dimensions: semantic alignment, term-level alignment, global-level alignment, and compile-checking. Separate reward models are trained via preference modeling, and their normalized outputs are combined to guide optimization via PPO. Finally, a curriculum learning strategy based on multi-dimensional difficulty allows the model to learn progressively from simple to complex cases. Experiments on NL-to-Lean 4 tasks show that our method consistently outperforms baseline models. Further analysis on reward model and curriculum learning confirms their effectiveness in enhancing model performance.

Anthology ID:: 2025.emnlp-main.1586
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 31136–31146
Language:
URL:: https://aclanthology.org/2025.emnlp-main.1586/
DOI:
Bibkey:
Cite (ACL):: Yue Fang, Shaohan Huang, Xin Yu, Haizhen Huang, Zihan Zhang, Weiwei Deng, Furu Wei, Feng Sun, Qi Zhang, and Zhi Jin. 2025. NL2Lean: Translating Natural Language into Lean 4 through Multi-Aspect Reinforcement Learning. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 31136–31146, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: NL2Lean: Translating Natural Language into Lean 4 through Multi-Aspect Reinforcement Learning (Fang et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.1586.pdf
Checklist:: 2025.emnlp-main.1586.checklist.pdf

PDF Cite Search Checklist Fix data