From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens

Hala Sheta; Eric Haoran Huang; Shuyu Wu; Ilia Alenabi; Jiajun Hong; Ryker Lin; Ruoxi Ning; Daniel Wei; Jialin Yang; Jiawei Zhou; Ziqiao Ma; Freda Shi

doi:10.18653/v1/2025.emnlp-demos.68

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens

Hala Sheta, Eric Haoran Huang, Shuyu Wu, Ilia Alenabi, Jiajun Hong, Ryker Lin, Ruoxi Ning, Daniel Wei, Jialin Yang, Jiawei Zhou, Ziqiao Ma, Freda Shi

Abstract

We introduce VLM-Lens, a toolkit designed to enable systematic benchmarking, analysis, and interpretation of vision-language models (VLMs) by supporting the extraction of intermediate outputs from any layer during the forward pass of open-source VLMs. VLM-Lens provides a unified, YAML-configurable interface that abstracts away model-specific complexities and supports user-friendly operation across diverse VLMs. It currently supports 16 state-of-the-art base VLMs and their over 30 variants, and is extensible to accommodate new models without changing the core logic.The toolkit integrates easily with various interpretability and analysis methods. We demonstrate its usage with two simple analytical experiments, revealing systematic differences in the hidden representations of VLMs across layers and target concepts. VLM-Lens is released as an open-sourced project to accelerate community efforts in understanding and improving VLMs.

Anthology ID:: 2025.emnlp-demos.68
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Ivan Habernal, Peter Schulam, Jörg Tiedemann
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 886–895
Language:
URL:: https://aclanthology.org/2025.emnlp-demos.68/
DOI:: 10.18653/v1/2025.emnlp-demos.68
Bibkey:
Cite (ACL):: Hala Sheta, Eric Haoran Huang, Shuyu Wu, Ilia Alenabi, Jiajun Hong, Ryker Lin, Ruoxi Ning, Daniel Wei, Jialin Yang, Jiawei Zhou, Ziqiao Ma, and Freda Shi. 2025. From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 886–895, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens (Sheta et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-demos.68.pdf

PDF Cite Search Fix data