Extract-Select: A Span Selection Framework for Nested Named Entity Recognition with Generative Adversarial Training

Peixin Huang; Xiang Zhao; Minghao Hu; Yang Fang; Xinyi Li; Weidong Xiao

doi:10.18653/v1/2022.findings-acl.9

Extract-Select: A Span Selection Framework for Nested Named Entity Recognition with Generative Adversarial Training

Peixin Huang, Xiang Zhao, Minghao Hu, Yang Fang, Xinyi Li, Weidong Xiao

Abstract

Nested named entity recognition (NER) is a task in which named entities may overlap with each other. Span-based approaches regard nested NER as a two-stage span enumeration and classification task, thus having the innate ability to handle this task. However, they face the problems of error propagation, ignorance of span boundary, difficulty in long entity recognition and requirement on large-scale annotated data. In this paper, we propose Extract-Select, a span selection framework for nested NER, to tackle these problems. Firstly, we introduce a span selection framework in which nested entities with different input categories would be separately extracted by the extractor, thus naturally avoiding error propagation in two-stage span-based approaches. In the inference phase, the trained extractor selects final results specific to the given entity category. Secondly, we propose a hybrid selection strategy in the extractor, which not only makes full use of span boundary but also improves the ability of long entity recognition. Thirdly, we design a discriminator to evaluate the extraction result, and train both extractor and discriminator with generative adversarial training (GAT). The use of GAT greatly alleviates the stress on the dataset size. Experimental results on four benchmark datasets demonstrate that Extract-Select outperforms competitive nested NER models, obtaining state-of-the-art results. The proposed model also performs well when less labeled data are given, proving the effectiveness of GAT.

Anthology ID:: 2022.findings-acl.9
Volume:: Findings of the Association for Computational Linguistics: ACL 2022
Month:: May
Year:: 2022
Address:: Dublin, Ireland
Editors:: Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 85–96
Language:
URL:: https://aclanthology.org/2022.findings-acl.9/
DOI:: 10.18653/v1/2022.findings-acl.9
Bibkey:
Cite (ACL):: Peixin Huang, Xiang Zhao, Minghao Hu, Yang Fang, Xinyi Li, and Weidong Xiao. 2022. Extract-Select: A Span Selection Framework for Nested Named Entity Recognition with Generative Adversarial Training. In Findings of the Association for Computational Linguistics: ACL 2022, pages 85–96, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):: Extract-Select: A Span Selection Framework for Nested Named Entity Recognition with Generative Adversarial Training (Huang et al., Findings 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.findings-acl.9.pdf

PDF Cite Search Fix data