Constructing a Japanese Verdict Prediction Dataset for Fact-Checking of LLM-Generated Texts

Miwa Masano; Hirokazu Kiyomaru; Atsushi Keyaki; Kaito Horio; Rei Minamoto; Ribeka Keyaki; Kouta Nakayama; Hideyuki Tachibana; Daisuke Kawahara

Constructing a Japanese Verdict Prediction Dataset for Fact-Checking of LLM-Generated Texts

Miwa Masano, Hirokazu Kiyomaru, Atsushi Keyaki, Kaito Horio, Rei Minamoto, Ribeka Keyaki, Kouta Nakayama, Hideyuki Tachibana, Daisuke Kawahara

Abstract

The development of fact-checking systems for verifying the factuality of text generated by large language models (LLMs) has been advancing.In the verdict prediction step of such systems, the system determines whether claims in the generated text are supported by retrieved evidence, formulated as a natural language inference (NLI) task.This study extends the label set for verdict prediction to capture claim-evidence relationships that humans would commonly interpret as supported or refuted, even in the absence of strict logical entailment or contradiction.It also constructs a Japanese dataset comprising 28,147 instances from two sources based on this extended label set.We analyze the causes of annotation disagreement and find that ambiguity in the boundary of acceptable inference, interpretive characteristics of negative cases, and incomplete information in the evidence affect annotation variability.Using this dataset, we evaluate the performance of prompt-based verdict prediction methods and show that prompts that explicitly elicit chain-of-thought reasoning improve F1 by 4 percentage points compared to baseline.

Anthology ID:: 2026.acl-srw.99
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Santosh T.Y.S.S., Juan Diego Rodriguez, Ona de Gibert
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1139–1151
Language:
URL:: https://aclanthology.org/2026.acl-srw.99/
DOI:
Bibkey:
Cite (ACL):: Miwa Masano, Hirokazu Kiyomaru, Atsushi Keyaki, Kaito Horio, Rei Minamoto, Ribeka Keyaki, Kouta Nakayama, Hideyuki Tachibana, and Daisuke Kawahara. 2026. Constructing a Japanese Verdict Prediction Dataset for Fact-Checking of LLM-Generated Texts. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 1139–1151, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Constructing a Japanese Verdict Prediction Dataset for Fact-Checking of LLM-Generated Texts (Masano et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-srw.99.pdf

PDF Cite Search Fix data