融合词典信息的古籍命名实体识别研究(A Study on the Recognition of Named Entities of Ancient Books Using Lexical Information)

Wenjun Kang (康文军), Jiali Zuo (左家莉), Anquan Jie (揭安全), Wenbin Luo (罗文兵), Mingwen Wang (王明文)


Abstract
“古籍命名实体识别对于古籍实体知识库与语料库的建设具有显著的现实意义。目前古籍命名实体识别的研究较少,主要原因是缺乏足够的训练语料。本文从《资治通鉴》入手,人工构建了一份古籍命名实体识别数据集,以此展开对古籍命名实体识别任务的研究。针对古籍文本多以单字表意且存在大量省略的语言特点,本文采用预训练词向量作为词典信息,充分利用其中蕴涵的词汇信息。实验表明,这种方法可以有效处理古籍文本中人名实体识别的问题。”
Anthology ID:
2023.ccl-1.21
Volume:
Proceedings of the 22nd Chinese National Conference on Computational Linguistics
Month:
August
Year:
2023
Address:
Harbin, China
Editors:
Maosong Sun, Bing Qin, Xipeng Qiu, Jing Jiang, Xianpei Han
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
229–240
Language:
Chinese
URL:
https://aclanthology.org/2023.ccl-1.21
DOI:
Bibkey:
Cite (ACL):
Wenjun Kang, Jiali Zuo, Anquan Jie, Wenbin Luo, and Mingwen Wang. 2023. 融合词典信息的古籍命名实体识别研究(A Study on the Recognition of Named Entities of Ancient Books Using Lexical Information). In Proceedings of the 22nd Chinese National Conference on Computational Linguistics, pages 229–240, Harbin, China. Chinese Information Processing Society of China.
Cite (Informal):
融合词典信息的古籍命名实体识别研究(A Study on the Recognition of Named Entities of Ancient Books Using Lexical Information) (Kang et al., CCL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.ccl-1.21.pdf