Label Refinement via Contrastive Learning for Distantly-Supervised Named Entity Recognition

Huaiyuan Ying, Shengxuan Luo, Tiantian Dang, Sheng Yu


Abstract
Distantly-supervised named entity recognition (NER) locates and classifies entities using only knowledge bases and unlabeled corpus to mitigate the reliance on human-annotated labels. The distantly annotated data suffer from the noise in labels, and previous works on DSNER have proved the importance of pre-refining distant labels with hand-crafted rules and extra existing semantic information. In this work, we explore the way to directly learn the distant label refinement knowledge by imitating annotations of different qualities and comparing these annotations in contrastive learning frameworks. the proposed distant label refinement model can give modified suggestions on distant data without additional supervised labels, and thus reduces the requirement on the quality of the knowledge bases. We perform extensive experiments and observe that recent and state-of-the-art DSNER methods gain evident benefits with our method.
Anthology ID:
2022.findings-naacl.203
Volume:
Findings of the Association for Computational Linguistics: NAACL 2022
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Marine Carpuat, Marie-Catherine de Marneffe, Ivan Vladimir Meza Ruiz
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2656–2666
Language:
URL:
https://aclanthology.org/2022.findings-naacl.203
DOI:
10.18653/v1/2022.findings-naacl.203
Bibkey:
Cite (ACL):
Huaiyuan Ying, Shengxuan Luo, Tiantian Dang, and Sheng Yu. 2022. Label Refinement via Contrastive Learning for Distantly-Supervised Named Entity Recognition. In Findings of the Association for Computational Linguistics: NAACL 2022, pages 2656–2666, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
Label Refinement via Contrastive Learning for Distantly-Supervised Named Entity Recognition (Ying et al., Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-naacl.203.pdf
Software:
 2022.findings-naacl.203.software.zip
Video:
 https://aclanthology.org/2022.findings-naacl.203.mp4
Code
 yinghy18/credel
Data
NCBI Disease