Theorem-Validated Reverse Chain-of-Thought Problem Generation for Geometric Reasoning

Deng Linger; Linghao Zhu; Yuliang Liu; Yu Wang (王昱, 王雨); Qunyi Xie; Jingjing Wu; Gang Zhang; Yingying Zhu; Xiang Bai

Theorem-Validated Reverse Chain-of-Thought Problem Generation for Geometric Reasoning

Deng Linger, Linghao Zhu, Yuliang Liu, Yu Wang, Qunyi Xie, Jingjing Wu, Gang Zhang, Yingying Zhu, Xiang Bai

Abstract

Large Multimodal Models (LMMs) face limitations in geometric reasoning due to insufficient Chain of Thought (CoT) image-text training data. While existing approaches leverage template-based or LLM-assisted methods for geometric CoT data creation, they often face challenges in achieving both diversity and precision. To bridge this gap, we introduce a two-stage Theorem-Validated Reverse Chain-of-Thought Reasoning Synthesis (TR-CoT) framework. The first stage, TR-Engine, synthesizes theorem-grounded geometric diagrams with structured descriptions and properties. The second stage, TR-Reasoner, employs reverse reasoning to iteratively refine question-answer pairs by cross-validating geometric properties and description fragments. Our approach expands theorem-type coverage, corrects long-standing misunderstandings, and enhances geometric reasoning. Fine-grained CoT improves theorem understanding and increases logical consistency by 24.5%. Our best models surpass the baselines in MathVista and GeoQA by 10.1% and 4.7%, outperforming advanced closed-source models like GPT-4o.

Anthology ID:: 2025.emnlp-main.38
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 718–735
Language:
URL:: https://aclanthology.org/2025.emnlp-main.38/
DOI:
Bibkey:
Cite (ACL):: Deng Linger, Linghao Zhu, Yuliang Liu, Yu Wang, Qunyi Xie, Jingjing Wu, Gang Zhang, Yingying Zhu, and Xiang Bai. 2025. Theorem-Validated Reverse Chain-of-Thought Problem Generation for Geometric Reasoning. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 718–735, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Theorem-Validated Reverse Chain-of-Thought Problem Generation for Geometric Reasoning (Linger et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.38.pdf
Checklist:: 2025.emnlp-main.38.checklist.pdf

PDF Cite Search Checklist Fix data