Construction of Segmentation and Part of Speech Annotation Model in Ancient Chinese

Longjie Jiang, Qinyu C. Chang, Huyin H. Xie, Zhuying Z. Xia


Abstract
Among the four civilizations in the world with the longest history, only Chinese civilization has been inherited and never interrupted for 5000 years. An important factor is that the Chinese nation has the fine tradition of sorting out classics. Recording history with words, inheriting culture through continuous collation of indigenous accounts, and maintaining the spread of Chinese civilization. In this competition, the siku-roberta model was introduced into the part-of-speech tagging task of ancient Chinese by using the Zuozhuan data set, and good prediction results were obtained.
Anthology ID:
2022.lt4hala-1.23
Volume:
Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Rachele Sprugnoli, Marco Passarotti
Venue:
LT4HALA
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
155–158
Language:
URL:
https://aclanthology.org/2022.lt4hala-1.23
DOI:
Bibkey:
Cite (ACL):
Longjie Jiang, Qinyu C. Chang, Huyin H. Xie, and Zhuying Z. Xia. 2022. Construction of Segmentation and Part of Speech Annotation Model in Ancient Chinese. In Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages, pages 155–158, Marseille, France. European Language Resources Association.
Cite (Informal):
Construction of Segmentation and Part of Speech Annotation Model in Ancient Chinese (Jiang et al., LT4HALA 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lt4hala-1.23.pdf