Qixu Cai
2024
The Fourth Evaluation on Chinese Spatial Cognition
Liming Xiao
|
Nan Hu
|
Weidong Zhan
|
Yuhang Qin
|
Sirui Deng
|
Chunhui Sun
|
Qixu Cai
|
Nan Li
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 3: Evaluations)
“The Fourth Chinese Spatial Cognition Evaluation Task (SpaCE 2024) presents the first comprehensive Chinese benchmark to assess spatial semantic understanding and reasoning capabilities of Large Language Models (LLMs). It comprises five subtasks in the form of multiple-choice questions: (1) identifying spatial semantic roles; (2) retrieving spatial referents; (3) detecting spatial semantic anomalies; (4) recognizing synonymous spatial expression with different forms; (5) conducting spatial position reasoning. In addition to proposing new tasks, SpaCE 2024 applied a rule-based method to generate high-quality synthetic data with difficulty levels for the reasoning task. 12 teams submitted their models and results, and the top-performing team attained an accuracy of 60.24%, suggesting that there is still significant room for current LLMs to improve, especially in tasks requiring high spatial cognitive processing.”
Search
Fix author
Co-authors
- Sirui Deng 1
- Nan Hu 1
- Nan Li (李楠) 1
- Yuhang Qin (秦宇航) 1
- Chunhui Sun (孙春晖) 1
- show all...
Venues
- ccl1