G-Cap: A Game Character Caption Generator

Yang Yang; Feng Hu; Haiming Zhang; XU Cheng; Gui Zheng; Liang Yao; Wenqi Ren

G-Cap: A Game Character Caption Generator

Yang Yang, Feng Hu, Haiming Zhang, XU Cheng, Gui Zheng, Liang Yao, Wenqi Ren

Abstract

While Large Vision-Language Models (LVLMs) have demonstrated remarkable proficiency in image captioning, existing research primarily focuses on real-world scenarios, leaving surreal, highly stylized, and semantically hybrid virtual-world scenarios significantly underexplored. In this work, we introduce Game Character Captioning, a novel task designed to evaluate LVLMs’ capability to perceive and describe game character from the virtual-world. To facilitate evaluation, we establish GC-Bench, a manually annotated benchmark, and propose Graph-F1 to effectively assess performance on this task. Our evaluation reveals that: (1) current state-of-the-art LVLMs, including closed-source giants such as Gemini 3 Pro and GPT-5.1, struggle to maintain the high performance seen in real-world scenarios; and (2) a notable gap exists between open-source and closed-source models. To bridge this gap, we construct GC-148K, a large-scale dataset generated via a specialized data pipeline, and develop the G-Cap series. Experiments demonstrate that G-Cap series rivals the performance of advanced closed-source models at a lower cost, offering an efficient solution for industrial-grade production environment.

Anthology ID:: 2026.acl-long.248
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5455–5473
Language:
URL:: https://aclanthology.org/2026.acl-long.248/
DOI:
Bibkey:
Cite (ACL):: Yang Yang, Feng Hu, Haiming Zhang, XU Cheng, Gui Zheng, Liang Yao, and Wenqi Ren. 2026. G-Cap: A Game Character Caption Generator. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5455–5473, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: G-Cap: A Game Character Caption Generator (Yang et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.248.pdf
Checklist:: 2026.acl-long.248.checklist.pdf

PDF Cite Search Checklist Fix data