基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus)

Changwei Xu (许长伟), Minxuan Feng (冯敏萱), Bin Li (李斌), Yiguo Yuan (袁义国)


Abstract
《古籍汉字分级字表》是基于大规模古籍文本语料库、为辅助学习者古籍文献阅读而研制的分级字表。该字表填补了古籍字表研究成果的空缺,依据各汉字学习优先级别的不同,实现了古籍汉字的等级划分,目前收录一级字105个,二级字340个,三级字555个。本文介绍了该字表研制的主要依据和基本步骤,并将其与传统识字教材“三百千”及《现代汉语常用字表》进行比较,验证了其收字的合理性。该字表有助于学习者优先掌握古籍文本常用字,提升古籍阅读能力,从而促进中华优秀传统文化的继承与发展。
Anthology ID:
2021.ccl-1.70
Volume:
Proceedings of the 20th Chinese National Conference on Computational Linguistics
Month:
August
Year:
2021
Address:
Huhhot, China
Editors:
Sheng Li (李生), Maosong Sun (孙茂松), Yang Liu (刘洋), Hua Wu (吴华), Kang Liu (刘康), Wanxiang Che (车万翔), Shizhu He (何世柱), Gaoqi Rao (饶高琦)
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
781–791
Language:
Chinese
URL:
https://aclanthology.org/2021.ccl-1.70
DOI:
Bibkey:
Cite (ACL):
Changwei Xu, Minxuan Feng, Bin Li, and Yiguo Yuan. 2021. 基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus). In Proceedings of the 20th Chinese National Conference on Computational Linguistics, pages 781–791, Huhhot, China. Chinese Information Processing Society of China.
Cite (Informal):
基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus) (Xu et al., CCL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.ccl-1.70.pdf