M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection

Chia-Wei Tang; Ting-Chih Chen; Kiet A. Nguyen; Kazi Sajeed Mehrab; Alvi Md Ishmam; Chris Thomas

doi:10.18653/v1/2024.emnlp-main.1243

M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection

Chia-Wei Tang, Ting-Chih Chen, Kiet A. Nguyen, Kazi Sajeed Mehrab, Alvi Md Ishmam, Chris Thomas

Abstract

Fact-checking claims is a highly laborious task that involves understanding how each factual assertion within the claim relates to a set of trusted source materials. Existing approaches make sample-level predictions but fail to identify the specific aspects of the claim that are troublesome and the specific evidence relied upon. In this paper, we introduce a method and new benchmark for this challenging task. Our method predicts the fine-grained logical relationship of each aspect of the claim from a set of multimodal documents, which include text, image(s), video(s), and audio(s). We also introduce a new benchmark (M3DC) of claims requiring multimodal multidocument reasoning, which we construct using a novel claim synthesis technique. Experiments show that our approach outperforms other models on this challenging task on two benchmarks while providing finer-grained predictions, explanations, and evidence.

Anthology ID:: 2024.emnlp-main.1243
Volume:: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22270–22293
Language:
URL:: https://aclanthology.org/2024.emnlp-main.1243/
DOI:: 10.18653/v1/2024.emnlp-main.1243
Bibkey:
Cite (ACL):: Chia-Wei Tang, Ting-Chih Chen, Kiet A. Nguyen, Kazi Sajeed Mehrab, Alvi Md Ishmam, and Chris Thomas. 2024. M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 22270–22293, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection (Tang et al., EMNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.emnlp-main.1243.pdf

PDF Cite Search Fix data