RCI: A Score for Evaluating Global and Local Reasoning in Multimodal Benchmarks

Amit Agarwal; Hitesh Laxmichand Patel; Srikant Panda; Hansa Meghwani; Jyotika Singh; Karan Dua; Paul Li; Tao Sheng; Sujith Ravi; Dan Roth

RCI: A Score for Evaluating Global and Local Reasoning in Multimodal Benchmarks

Amit Agarwal, Hitesh Laxmichand Patel, Srikant Panda, Hansa Meghwani, Jyotika Singh, Karan Dua, Paul Li, Tao Sheng, Sujith Ravi, Dan Roth

Abstract

Multimodal Large Language Models (MLLMs) have achieved impressive results on vision-language benchmarks, yet it remains unclear whether these benchmarks assess genuine global reasoning or allow success via localized visual cues. Existing evaluation methods do not explicitly measure this distinction, hindering effective dataset curation and real-world focused model development.We introduce Region Comprehension Index (RCI), the first model-based score to directly quantify a dataset’s reliance on global versus local visual information. RCI systematically compares reference-model performance on image patches versus full images, revealing if tasks require holistic image understanding or can be solved with partial or localized visual cues.When applying RCI to 13 widely used multimodal benchmarks, we observed that most of them favor localized reasoning and exhibit significant spatial biases, indicating potential risks in real-world applications. RCI equips researchers & practitioners with an actionable tool for diagnosing & mitigating these biases, enabling the construction of datasets and benchmarks to foster the development of robust, enterprise-ready multimodal systems.

Anthology ID:: 2025.emnlp-industry.10
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:: November
Year:: 2025
Address:: Suzhou (China)
Editors:: Saloni Potdar, Lina Rojas-Barahona, Sebastien Montella
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 138–157
Language:
URL:: https://aclanthology.org/2025.emnlp-industry.10/
DOI:
Bibkey:
Cite (ACL):: Amit Agarwal, Hitesh Laxmichand Patel, Srikant Panda, Hansa Meghwani, Jyotika Singh, Karan Dua, Paul Li, Tao Sheng, Sujith Ravi, and Dan Roth. 2025. RCI: A Score for Evaluating Global and Local Reasoning in Multimodal Benchmarks. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 138–157, Suzhou (China). Association for Computational Linguistics.
Cite (Informal):: RCI: A Score for Evaluating Global and Local Reasoning in Multimodal Benchmarks (Agarwal et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-industry.10.pdf

PDF Cite Search Fix data