JanusMM: A Benchmark for Self-Deprecation Understanding in Real-World Multimodal Conversations

Xinyi Xu; Bingguang Hao; Yongyi Xiong; Zimo Chen; Xinchen Liu; Hongxin Guo; Xuelong Wang; Silin Zhou; Shihan Dou

JanusMM: A Benchmark for Self-Deprecation Understanding in Real-World Multimodal Conversations

Xinyi Xu, Bingguang Hao, Yongyi Xiong, Zimo Chen, Xinchen Liu, Hongxin Guo, Xuelong Wang, Silin Zhou, Shihan Dou

Abstract

Self-deprecation is a prevalent communicative strategy in human society, often using image-text interplay to express emotions and intentions. Despite self-deprecation is widespread in real-world conversations, the ability of multimodal large language models (MLLMs) to understand it remains underexplored. To fill this gap, we introduce **JanusMM**, the first benchmark designed to evaluate MLLMs’ understanding of self-deprecation in real-world conversations. JanusMM contains 2,016 bilingual memes from three types of social interactions and provides a dual-task evaluation framework with six new metrics. The first task assesses MLLMs’ abilities in self-deprecation recognition and reasoning, while the second task evaluates the consistency of their understanding by simulating the perspectives of the initiator and responder. We evaluate ten frontier MLLMs and find that they exhibit weak recognition and reasoning abilities, with their understanding of self-deprecation remaining inconsistent across both perspectives.

Anthology ID:: 2026.acl-long.1116
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 24324–24343
Language:
URL:: https://aclanthology.org/2026.acl-long.1116/
DOI:
Bibkey:
Cite (ACL):: Xinyi Xu, Bingguang Hao, Yongyi Xiong, Zimo Chen, Xinchen Liu, Hongxin Guo, Xuelong Wang, Silin Zhou, and Shihan Dou. 2026. JanusMM: A Benchmark for Self-Deprecation Understanding in Real-World Multimodal Conversations. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 24324–24343, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: JanusMM: A Benchmark for Self-Deprecation Understanding in Real-World Multimodal Conversations (Xu et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1116.pdf
Checklist:: 2026.acl-long.1116.checklist.pdf

PDF Cite Search Checklist Fix data