CodeV: Issue Resolving with Visual Data

Linhao Zhang; Daoguang Zan; Quanshun Yang; Zhirong Huang; Dong Chen; Bo Shen; Tianyu Liu; Yongshun Gong; Huang Pengjie; Xudong Lu; Guangtai Liang; Lizhen Cui; Qianxiang Wang

doi:10.18653/v1/2025.findings-acl.384

CodeV: Issue Resolving with Visual Data

Linhao Zhang, Daoguang Zan, Quanshun Yang, Zhirong Huang, Dong Chen, Bo Shen, Tianyu Liu, Yongshun Gong, Huang Pengjie, Xudong Lu, Guangtai Liang, Lizhen Cui, Qianxiang Wang

Abstract

Large Language Models (LLMs) have advanced rapidly in recent years, with their applications in software engineering expanding to more complex repository-level tasks. GitHub issue resolving is a key challenge among these tasks. While recent approaches have made progress on this task, they focus on textual data within issues, neglecting visual data. However, this visual data is crucial for resolving issues as it conveys additional knowledge that text alone cannot. We propose CodeV, the first approach to leveraging visual data to enhance the issue-resolving capabilities of LLMs. CodeV resolves each issue by following a two-phase process: data processing and patch generation. To evaluate CodeV, we construct a benchmark for visual issue resolving, namely Visual SWE-bench. Through extensive experiments, we demonstrate the effectiveness of CodeV, as well as provide valuable insights into leveraging visual data to resolve GitHub issues.

Anthology ID:: 2025.findings-acl.384
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7350–7361
Language:
URL:: https://aclanthology.org/2025.findings-acl.384/
DOI:: 10.18653/v1/2025.findings-acl.384
Bibkey:
Cite (ACL):: Linhao Zhang, Daoguang Zan, Quanshun Yang, Zhirong Huang, Dong Chen, Bo Shen, Tianyu Liu, Yongshun Gong, Huang Pengjie, Xudong Lu, Guangtai Liang, Lizhen Cui, and Qianxiang Wang. 2025. CodeV: Issue Resolving with Visual Data. In Findings of the Association for Computational Linguistics: ACL 2025, pages 7350–7361, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: CodeV: Issue Resolving with Visual Data (Zhang et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.384.pdf

PDF Cite Search Fix data