Protecting multimodal large language models against misleading visualizations

Jonathan Tonglet; Tinne Tuytelaars; Marie Francine Moens; Iryna Gurevych

Protecting multimodal large language models against misleading visualizations

Jonathan Tonglet, Tinne Tuytelaars, Marie-Francine Moens, Iryna Gurevych

Abstract

Visualizations play a pivotal role in daily communication in an increasingly data-driven world. Research on multimodal large language models (MLLMs) for automated chart understanding has accelerated massively, with steady improvements on standard benchmarks. However, for MLLMs to be reliable, they must be robust to misleading visualizations, i.e., charts that distort the underlying data, leading readers to draw inaccurate conclusions. Here, we uncover an important vulnerability: MLLM question-answering (QA) accuracy on misleading visualizations drops on average to the level of the random baseline. To address this, we provide the first comparison of six inference-time methods to improve QA performance on misleading visualizations, without compromising accuracy on non-misleading ones. We find that two methods, table-based QA and redrawing the visualization, are effective, with improvements of up to 19.6 percentage points. We make our code and data available.

Anthology ID:: 2026.acl-long.377
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8329–8349
Language:
URL:: https://aclanthology.org/2026.acl-long.377/
DOI:
Bibkey:
Cite (ACL):: Jonathan Tonglet, Tinne Tuytelaars, Marie-Francine Moens, and Iryna Gurevych. 2026. Protecting multimodal large language models against misleading visualizations. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 8329–8349, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Protecting multimodal large language models against misleading visualizations (Tonglet et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.377.pdf
Checklist:: 2026.acl-long.377.checklist.pdf

PDF Cite Search Checklist Fix data