GLiNER2: Schema-Driven Multi-Task Learning for Structured Information Extraction

Urchade Zaratiana; Gil Pasternak; Oliver Boyd; George Hurn-Maloney; Ash Lewis

doi:10.18653/v1/2025.emnlp-demos.10

GLiNER2: Schema-Driven Multi-Task Learning for Structured Information Extraction

Urchade Zaratiana, Gil Pasternak, Oliver Boyd, George Hurn-Maloney, Ash Lewis

Abstract

Information extraction (IE) is fundamental to numerous NLP applications, yet existing solutions often require specialized models for different tasks or rely on computationally expensive large language models. We present GLiNER2, a unified framework that enhances the original GLiNER architecture to support named entity recognition, text classification, and hierarchical structured data extraction within a single efficient model. Built on a fine-tuned encoder architecture, GLiNER2 maintains CPU efficiency and compact size while introducing multi-task composition through an intuitive schema-based interface. Our experiments demonstrate competitive performance across diverse IE tasks with substantial improvements in deployment accessibility compared to LLM-based alternatives. We release GLiNER2 as an open-source library available through pip, complete with pre-trained models and comprehensive documentation.

Anthology ID:: 2025.emnlp-demos.10
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Ivan Habernal, Peter Schulam, Jörg Tiedemann
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 130–140
Language:
URL:: https://aclanthology.org/2025.emnlp-demos.10/
DOI:: 10.18653/v1/2025.emnlp-demos.10
Bibkey:
Cite (ACL):: Urchade Zaratiana, Gil Pasternak, Oliver Boyd, George Hurn-Maloney, and Ash Lewis. 2025. GLiNER2: Schema-Driven Multi-Task Learning for Structured Information Extraction. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 130–140, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: GLiNER2: Schema-Driven Multi-Task Learning for Structured Information Extraction (Zaratiana et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-demos.10.pdf

PDF Cite Search Fix data