中文图文多模态理解评测

Wang Yuxuan (王宇轩), Liu Yijun (刘议骏), Wan Zhiguo (万志国), Che Wanxiang (车万翔)


Abstract
“中文图文多模态理解评测任务旨在从多角度评价中文图文多模态预训练模型的图文多模态建模和理解能力。本任务共包括五个子任务:图片检索、文本检索、视觉问答、视觉定位和视觉对话,最终成绩根据这五个任务的得分综合计算。本文首先介绍了任务的背景和动机,然后从任务介绍、评价指标、比赛结果、参赛方法等方面介绍并展示了本次评测任务的相关信息。本次任务共有11支队伍报名参赛,其中3支队伍提交了结果。”
Anthology ID:
2024.ccl-3.42
Volume:
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 3: Evaluations)
Month:
July
Year:
2024
Address:
Taiyuan, China
Editors:
Hongfei Lin, Hongye Tan, Bin Li
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
374–381
Language:
Chinese
URL:
https://aclanthology.org/2024.ccl-3.42/
DOI:
Bibkey:
Cite (ACL):
Wang Yuxuan, Liu Yijun, Wan Zhiguo, and Che Wanxiang. 2024. 中文图文多模态理解评测. In Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 3: Evaluations), pages 374–381, Taiyuan, China. Chinese Information Processing Society of China.
Cite (Informal):
中文图文多模态理解评测 (Yuxuan et al., CCL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.ccl-3.42.pdf