C3LRSO: A Chinese Corpus for Complex Logical Reasoning in Sentence Ordering

Xiaotao Guo, Jiang Li, Xiangdong Su, Fujun Zhang


Abstract
Sentence ordering is the task of rearranging a set of unordered sentences into a coherent and logically consistent sequence. Recent work has primarily used pre-trained language models, achieving significant success in the task. However, existing sentence ordering corpora are predominantly in English, and comprehensive benchmark datasets for non-English languages are unavailable. Meanwhile, current datasets often insert specific markers into paragraphs, inadvertently making the logical sequence between sentences more apparent and reducing the models’ ability to handle genuinely unordered sentences in real applications. To address these limitations, we develop C3LRSO, a high-quality Chinese sentence ordering dataset that overcomes the aforementioned shortcomings by providing genuinely unordered sentences without artificial segmentation cues. Furthermore, given the outstanding performance of large language models on NLP tasks, we evaluate these models on our dataset for this task. Additionally, we propose a simple yet effective parameter-free approach that outperforms existing methods on this task. Experiments demonstrate the challenging nature of the dataset and the strong performance of our proposed method. These findings highlight the potential for further research in sentence ordering and the development of more robust language models. Our dataset is freely available at https://github.com/JasonGuo1/C3LRSO.
Anthology ID:
2025.coling-main.275
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4085–4095
Language:
URL:
https://aclanthology.org/2025.coling-main.275/
DOI:
Bibkey:
Cite (ACL):
Xiaotao Guo, Jiang Li, Xiangdong Su, and Fujun Zhang. 2025. C3LRSO: A Chinese Corpus for Complex Logical Reasoning in Sentence Ordering. In Proceedings of the 31st International Conference on Computational Linguistics, pages 4085–4095, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
C3LRSO: A Chinese Corpus for Complex Logical Reasoning in Sentence Ordering (Guo et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.275.pdf