Embedded Named Entity Recognition using Probing Classifiers

Nicholas Popovic, Michael Färber


Abstract
Streaming text generation, has become a common way of increasing the responsiveness of language model powered applications such as chat assistants. At the same time, extracting semantic information from generated text is a useful tool for applications such as automated fact checking or retrieval augmented generation. Currently, this requires either separate models during inference, which increases computational cost, or destructive fine-tuning of the language model. Instead, we propose an approach called EMBER which enables streaming named entity recognition in decoder-only language models without fine-tuning them and while incurring minimal additional computational cost at inference time. Specifically, our experiments show that EMBER maintains high token generation rates, with only a negligible decrease in speed of around 1% compared to a 43.64% slowdown measured for a baseline. We make our code and data available online, including a toolkit for training, testing, and deploying efficient token classification models optimized for streaming text generation.
Anthology ID:
2024.emnlp-main.988
Volume:
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
17830–17850
Language:
URL:
https://aclanthology.org/2024.emnlp-main.988
DOI:
Bibkey:
Cite (ACL):
Nicholas Popovic and Michael Färber. 2024. Embedded Named Entity Recognition using Probing Classifiers. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 17830–17850, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Embedded Named Entity Recognition using Probing Classifiers (Popovic & Färber, EMNLP 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.emnlp-main.988.pdf