Wang Jiangkuo

2024

pdf bib abs
Chinese Vision-Language Understanding Evaluation
Wang Jiangkuo | Zheng Linwei | Chen Kehai | Bai Xuefeng | Zhang Min
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 3: Evaluations)

“This paper introduces our systems submitted for the Chinese Vision-Language Understanding Evaluation task at the 23rd Chinese Computational Linguistics Conference.In this competition, we utilized X2-VLM and CCLM models to participate in various subtasks such as image-text retrieval, visual grounding, visual dialogue, and visual question answering. Additionally, we employed other models to assess performance on certain subtasks. We optimized our models and successfully applied them to these different tasks”

Co-authors

Chen Kehai (陈科海) 1
Zheng Linwei 1
Zhang Min (张民) 1
Bai Xuefeng (白雪峰) 1

Venues

ccl1

Fix data