SEKE: Specialised Experts for Keyword Extraction

Matej Martinc; Thi Hong Hanh Tran; Senja Pollak; Boshko Koloski

doi:10.18653/v1/2025.findings-emnlp.764

SEKE: Specialised Experts for Keyword Extraction

Matej Martinc, Thi Hong Hanh Tran, Senja Pollak, Boshko Koloski

Abstract

Keyword extraction involves identifying the most descriptive words in a document, allowing automatic categorisation and summarisation of large quantities of diverse textual data. Relying on the insight that real-world keyword detection often requires handling of diverse content, we propose a novel supervised keyword extraction approach based on the mixture of experts (MoE) technique. MoE uses a learnable routing sub-network to direct information to specialised experts, allowing them to specialise in distinct regions of the input space. SEKE, a mixture of Specialised Experts for supervised Keyword Extraction, uses DeBERTa as the backbone model and builds on the MoE framework, where experts attend to each token, by integrating it with a bidirectional Long short-term memory (BiLSTM) network, to allow successful extraction even on smaller corpora, where specialisation is harder due to lack of training data. The MoE framework also provides an insight into inner workings of individual experts, enhancing the explainability of the approach. We benchmark SEKE on multiple English datasets, achieving state-of-the-art performance compared to strong supervised and unsupervised baselines. Our analysis reveals that depending on data size and type, experts specialise in distinct syntactic and semantic components, such as punctuation, stopwords, parts-of-speech, or named entities. Code is available at https://github.com/matejMartinc/SEKE_keyword_extraction.

Anthology ID:: 2025.findings-emnlp.764
Original:: 2025.findings-emnlp.764v1
Version 2:: 2025.findings-emnlp.764v2
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 14191–14205
Language:
URL:: https://aclanthology.org/2025.findings-emnlp.764/
DOI:: 10.18653/v1/2025.findings-emnlp.764
Bibkey:
Cite (ACL):: Matej Martinc, Thi Hong Hanh Tran, Senja Pollak, and Boshko Koloski. 2025. SEKE: Specialised Experts for Keyword Extraction. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 14191–14205, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: SEKE: Specialised Experts for Keyword Extraction (Martinc et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-emnlp.764.pdf
Checklist:: 2025.findings-emnlp.764.checklist.pdf

PDF (v2) PDF (v1) Cite Search Checklist Fix data