MuMath: Multi-perspective Data Augmentation for Mathematical Reasoning in Large Language Models

Weihao You, Shuo Yin, Xudong Zhao, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai


Abstract
Recently, the tool-use Large Language Models (LLMs) that integrate with external Python interpreters have significantly enhanced mathematical reasoning capabilities for open-source LLMs. However, these models fall short in demonstrating the calculation process, which compromises user-friendliness and understanding of problem-solving steps. Conversely, while tool-free methods offer a clear display of the problem-solving process, their accuracy leaves room for improvement.These tool-free methods typically employ a somewhat narrow range of augmentation techniques such as rephrasing and difficulty enhancement to boost performance. In response to this issue, we have amalgamated and further refined these strengths while broadening the scope of augmentation methods to construct a **mu**lti-perspective augmentation dataset for **math**ematics—termed **MuMath** (𝜇-Math) Dataset.Subsequently, we finetune LLaMA-2 on the MuMath dataset to derive the MuMath model. Our experiments indicate that our MuMath-70B model achieves new state-of-the-art performance among tool-free methods—achieving 88.3% on GSM8K and 34.5% on MATH .We release the MuMath dataset along with its corresponding models and code for public use.
Anthology ID:
2024.findings-naacl.185
Volume:
Findings of the Association for Computational Linguistics: NAACL 2024
Month:
June
Year:
2024
Address:
Mexico City, Mexico
Editors:
Kevin Duh, Helena Gomez, Steven Bethard
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2932–2958
Language:
URL:
https://aclanthology.org/2024.findings-naacl.185
DOI:
10.18653/v1/2024.findings-naacl.185
Bibkey:
Cite (ACL):
Weihao You, Shuo Yin, Xudong Zhao, Zhilong Ji, Guoqiang Zhong, and Jinfeng Bai. 2024. MuMath: Multi-perspective Data Augmentation for Mathematical Reasoning in Large Language Models. In Findings of the Association for Computational Linguistics: NAACL 2024, pages 2932–2958, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):
MuMath: Multi-perspective Data Augmentation for Mathematical Reasoning in Large Language Models (You et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-naacl.185.pdf