T 2 -NER: A Two-Stage Span-Based Framework for Unified Named Entity Recognition with Templates

Peixin Huang; Xiang Zhao; Minghao Hu; Zhen Tan; Weidong Xiao

doi:10.1162/tacl_a_00602

T 2 -NER: A Two-Stage Span-Based Framework for Unified Named Entity Recognition with Templates

Peixin Huang, Xiang Zhao, Minghao Hu, Zhen Tan, Weidong Xiao

Abstract

Named Entity Recognition (NER) has so far evolved from the traditional flat NER to overlapped and discontinuous NER. They have mostly been solved separately, with only several exceptions that concurrently tackle three tasks with a single model. The current best-performing method formalizes the unified NER as word-word relation classification, which barely focuses on mention content learning and fails to detect entity mentions comprising a single word. In this paper, we propose a two-stage span-based framework with templates, namely, T2-NER, to resolve the unified NER task. The first stage is to extract entity spans, where flat and overlapped entities can be recognized. The second stage is to classify over all entity span pairs, where discontinuous entities can be recognized. Finally, multi-task learning is used to jointly train two stages. To improve the efficiency of span-based model, we design grouped templates and typed templates for two stages to realize batch computations. We also apply an adjacent packing strategy and a latter packing strategy to model discriminative boundary information and learn better span (pair) representation. Moreover, we introduce the syntax information to enhance our span representation. We perform extensive experiments on eight benchmark datasets for flat, overlapped, and discontinuous NER, where our model beats all the current competitive baselines, obtaining the best performance of unified NER.

Anthology ID:: 2023.tacl-1.72
Volume:: Transactions of the Association for Computational Linguistics, Volume 11
Month:
Year:: 2023
Address:: Cambridge, MA
Venue:: TACL
SIG:
Publisher:: MIT Press
Note:
Pages:: 1265–1282
Language:
URL:: https://aclanthology.org/2023.tacl-1.72/
DOI:: 10.1162/tacl_a_00602
Bibkey:
Cite (ACL):: Peixin Huang, Xiang Zhao, Minghao Hu, Zhen Tan, and Weidong Xiao. 2023. T 2 -NER: A Two-Stage Span-Based Framework for Unified Named Entity Recognition with Templates. Transactions of the Association for Computational Linguistics, 11:1265–1282.
Cite (Informal):: T 2 -NER: A Two-Stage Span-Based Framework for Unified Named Entity Recognition with Templates (Huang et al., TACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.tacl-1.72.pdf
Video:: https://aclanthology.org/2023.tacl-1.72.mp4

PDF Cite Search Video Fix data