Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation

Jen-tse Huang; Chang Chen; Shiyang Lai; Wenxuan Wang; Michelle R Kaufman; Mark Dredze

Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation

Jen-tse Huang, Chang Chen, Shiyang Lai, Wenxuan Wang, Michelle R Kaufman, Mark Dredze

Abstract

Short-video platforms have become major channels for misinformation, where deceptive claims frequently leverage visual experiments and social cues. While Multimodal Large Language Models (MLLMs) have demonstrated impressive reasoning capabilities, their robustness against misinformation entangled with cognitive biases remains under-explored. In this paper, we introduce a comprehensive evaluation framework using a high-quality, manually annotated dataset of 200 short videos spanning four health domains. This dataset provides fine-grained annotations for three deceptive patterns—experimental errors, logical fallacies, and fabricated claims—each verified by evidence such as national standards and academic literature. We evaluate eight frontier MLLMs across five modality settings. Experimental results demonstrate that Gemini-2.5-Pro achieves the highest performance in the multimodal setting with a belief score of 71.5/100, while o3 performs the worst at 35.2. Furthermore, we investigate social cues that induce false beliefs in videos and find that models are susceptible to biases like authoritative channel IDs.

Anthology ID:: 2026.findings-acl.616
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12675–12696
Language:
URL:: https://aclanthology.org/2026.findings-acl.616/
DOI:
Bibkey:
Cite (ACL):: Jen-tse Huang, Chang Chen, Shiyang Lai, Wenxuan Wang, Michelle R Kaufman, and Mark Dredze. 2026. Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 12675–12696, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation (Huang et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.616.pdf
Checklist:: 2026.findings-acl.616.checklist.pdf

PDF Cite Search Checklist Fix data