Learning to Progressively Recognize New Named Entities with Sequence to Sequence Models

Lingzhen Chen; Alessandro Moschitti

Learning to Progressively Recognize New Named Entities with Sequence to Sequence Models

Abstract

In this paper, we propose to use a sequence to sequence model for Named Entity Recognition (NER) and we explore the effectiveness of such model in a progressive NER setting – a Transfer Learning (TL) setting. We train an initial model on source data and transfer it to a model that can recognize new NE categories in the target data during a subsequent step, when the source data is no longer available. Our solution consists in: (i) to reshape and re-parametrize the output layer of the first learned model to enable the recognition of new NEs; (ii) to leave the rest of the architecture unchanged, such that it is initialized with parameters transferred from the initial model; and (iii) to fine tune the network on the target data. Most importantly, we design a new NER approach based on sequence to sequence (Seq2Seq) models, which can intuitively work better in our progressive setting. We compare our approach with a Bidirectional LSTM, which is a strong neural NER model. Our experiments show that the Seq2Seq model performs very well on the standard NER setting and it is more robust in the progressive setting. Our approach can recognize previously unseen NE categories while preserving the knowledge of the seen data.

Anthology ID:: C18-1185
Volume:: Proceedings of the 27th International Conference on Computational Linguistics
Month:: August
Year:: 2018
Address:: Santa Fe, New Mexico, USA
Editors:: Emily M. Bender, Leon Derczynski, Pierre Isabelle
Venue:: COLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2181–2191
Language:
URL:: https://aclanthology.org/C18-1185/
DOI:
Bibkey:
Cite (ACL):: Lingzhen Chen and Alessandro Moschitti. 2018. Learning to Progressively Recognize New Named Entities with Sequence to Sequence Models. In Proceedings of the 27th International Conference on Computational Linguistics, pages 2181–2191, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):: Learning to Progressively Recognize New Named Entities with Sequence to Sequence Models (Chen & Moschitti, COLING 2018)
Copy Citation:
PDF:: https://aclanthology.org/C18-1185.pdf

PDF Cite Search Fix data