OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning

Fei Yu, Anningzhe Gao, Benyou Wang


Anthology ID:
2024.findings-naacl.55
Volume:
Findings of the Association for Computational Linguistics: NAACL 2024
Month:
June
Year:
2024
Address:
Mexico City, Mexico
Editors:
Kevin Duh, Helena Gomez, Steven Bethard
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
858–875
Language:
URL:
https://aclanthology.org/2024.findings-naacl.55
DOI:
10.18653/v1/2024.findings-naacl.55
Bibkey:
Cite (ACL):
Fei Yu, Anningzhe Gao, and Benyou Wang. 2024. OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning. In Findings of the Association for Computational Linguistics: NAACL 2024, pages 858–875, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):
OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning (Yu et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-naacl.55.pdf