Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator

Chengyuan Liu; Shihang Wang; Lizhi Qing; Jun Lin; Ji Zhang; Fei Wu; Kun Kuang

doi:10.18653/v1/2025.naacl-long.245

Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator

Chengyuan Liu, Shihang Wang, Lizhi Qing, Jun Lin, Ji Zhang, Fei Wu, Kun Kuang

Abstract

Domain Large Language Models (LLMs) are developed for domain-specific tasks based on general LLMs. But it still requires professional knowledge to facilitate the expertise for some domain-specific tasks. In this paper, we investigate into knowledge-intensive calculation problems. We find that the math problems to be challenging for LLMs, when involving complex domain-specific rules and knowledge documents, rather than simple formulations of terminologies. Therefore, we propose a pipeline to solve the domain-specific calculation problems with Knowledge-Intensive Programs Generator more effectively, named as KIPG. It generates knowledge-intensive programs according to the domain-specific documents. For each query, key variables are extracted, then outcomes which are dependent on domain knowledge are calculated with the programs. By iterative preference alignment, the code generator learns to improve the logic consistency with the domain knowledge. Taking legal domain as an example, we have conducted experiments to prove the effectiveness of our pipeline, and extensive analysis on the modules. We also find that the code generator is also adaptable to other domains, without training on the new knowledge.

Anthology ID:: 2025.naacl-long.245
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4776–4791
Language:
URL:: https://aclanthology.org/2025.naacl-long.245/
DOI:: 10.18653/v1/2025.naacl-long.245
Bibkey:
Cite (ACL):: Chengyuan Liu, Shihang Wang, Lizhi Qing, Jun Lin, Ji Zhang, Fei Wu, and Kun Kuang. 2025. Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 4776–4791, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator (Liu et al., NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-long.245.pdf

PDF Cite Search Fix data