Diagnosing Vision Language Models’ Perception by Leveraging Human Methods for Color Vision Deficiencies

Kazuki Hayashi; Shintaro Ozaki; Yusuke Sakai; Hidetaka Kamigaito; Taro Watanabe

Diagnosing Vision Language Models’ Perception by Leveraging Human Methods for Color Vision Deficiencies

Kazuki Hayashi, Shintaro Ozaki, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe

Abstract

Large-scale Vision-Language Models (LVLMs) are being deployed in real-world settings that require visual inference. As capabilities improve, applications in navigation, education, and accessibility are becoming practical. These settings require accommodation of perceptual variation rather than assuming a uniform visual experience. Color perception illustrates this requirement: it is central to visual understanding yet varies across individuals due to Color Vision Deficiencies, an aspect largely ignored in multimodal AI.In this work, we examine whether LVLMs can account for variation in color perception using the Ishihara Test. We evaluate model behavior through generation, confidence, and internal representation, using Ishihara plates as controlled stimuli that expose perceptual differences. Although models possess factual knowledge about color vision deficiencies and can describe the test, they fail to reproduce the perceptual outcomes experienced by affected individuals and instead default to normative color perception. These results indicate that current systems lack mechanisms for representing alternative perceptual experiences, raising concerns for accessibility and inclusive deployment in multimodal settings.

Anthology ID:: 2026.eacl-long.356
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7582–7605
Language:
URL:: https://aclanthology.org/2026.eacl-long.356/
DOI:
Bibkey:
Cite (ACL):: Kazuki Hayashi, Shintaro Ozaki, Yusuke Sakai, Hidetaka Kamigaito, and Taro Watanabe. 2026. Diagnosing Vision Language Models’ Perception by Leveraging Human Methods for Color Vision Deficiencies. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7582–7605, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Diagnosing Vision Language Models’ Perception by Leveraging Human Methods for Color Vision Deficiencies (Hayashi et al., EACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.eacl-long.356.pdf
Checklist:: 2026.eacl-long.356.checklist.pdf

PDF Cite Search Checklist Fix data