Wang Jiangkuo
2024
Chinese Vision-Language Understanding Evaluation
Wang Jiangkuo
|
Zheng Linwei
|
Chen Kehai
|
Bai Xuefeng
|
Zhang Min
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 3: Evaluations)
“This paper introduces our systems submitted for the Chinese Vision-Language Understanding Evaluation task at the 23rd Chinese Computational Linguistics Conference.In this competition, we utilized X2-VLM and CCLM models to participate in various subtasks such as image-text retrieval, visual grounding, visual dialogue, and visual question answering. Additionally, we employed other models to assess performance on certain subtasks. We optimized our models and successfully applied them to these different tasks”