InsLogicBench: An Argumentation Logic Grounded Benchmark for Complex Insurance Claims Adjudication

Jin Liu; Yunpeng Liu; Keyi Wang; Jie Shi; Xiao Xu; Wenkang Huang; Xingzhong Xu; Xin Liang; Yanghua Xiao

InsLogicBench: An Argumentation Logic Grounded Benchmark for Complex Insurance Claims Adjudication

Jin Liu, Yunpeng Liu, Keyi Wang, Jie Shi, Xiao Xu, Wenkang Huang, Xingzhong Xu, Xin Liang, Yanghua Xiao

Abstract

Insurance claims adjudication demands not only accurate decisions but also interpretable reasoning grounded in policy clauses. However, existing benchmarks are limited to information retrieval or simple multiple-choice setups, which fail to require step-by-step inferences from facts to conclusions. To address this gap, we introduce InsLogicBench, a benchmark providing complete reasoning traces that link factual inputs, relevant policy clauses, and final verdicts. We construct the dataset using a controllable synthesis framework based on the Nested Toulmin Model. By capturing the defeasible logic of insurance policies through hierarchical truth assignment and enforcing validity via consistency verification, we ensure interpretability and logical rigor across generated examples. We evaluate eight Large Language Models (LLMs) on InsLogicBench. Results show significant difficulties in handling exception clauses and verifying missing conditions. Notably, models often produce correct final decisions but fail to provide precise justifications, highlighting a critical discrepancy between their decision accuracy and logical reasoning capabilities.

Anthology ID:: 2026.acl-long.1035
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22592–22619
Language:
URL:: https://aclanthology.org/2026.acl-long.1035/
DOI:
Bibkey:
Cite (ACL):: Jin Liu, Yunpeng Liu, Keyi Wang, Jie Shi, Xiao Xu, Wenkang Huang, Xingzhong Xu, Xin Liang, and Yanghua Xiao. 2026. InsLogicBench: An Argumentation Logic Grounded Benchmark for Complex Insurance Claims Adjudication. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 22592–22619, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: InsLogicBench: An Argumentation Logic Grounded Benchmark for Complex Insurance Claims Adjudication (Liu et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1035.pdf
Checklist:: 2026.acl-long.1035.checklist.pdf

PDF Cite Search Checklist Fix data