ERU-KG: Efficient Reference-aligned Unsupervised Keyphrase Generation

Lam Thanh Do; Aaditya Bodke; Pritom Saha Akash; Kevin Chen-Chuan Chang

doi:10.18653/v1/2025.acl-long.1398

ERU-KG: Efficient Reference-aligned Unsupervised Keyphrase Generation

Lam Thanh Do, Aaditya Bodke, Pritom Saha Akash, Kevin Chen-Chuan Chang

Abstract

Unsupervised keyphrase prediction has gained growing interest in recent years. However, existing methods typically rely on heuristically defined importance scores, which may lead to inaccurate informativeness estimation. In addition, they lack consideration for time efficiency. To solve these problems, we propose ERU-KG, an unsupervised keyphrase generation (UKG) model that consists of an informativeness and a phraseness module. The former estimates the relevance of keyphrase candidates, while the latter generate those candidates. The informativeness module innovates by learning to model informativeness through references (e.g., queries, citation contexts, and titles) and at the term-level, thereby 1) capturing how the key concepts of documents are perceived in different contexts and 2) estimating informativeness of phrases more efficiently by aggregating term informativeness, removing the need for explicit modeling of the candidates. ERU-KG demonstrates its effectiveness on keyphrase generation benchmarks by outperforming unsupervised baselines and achieving on average 89% of the performance of a supervised model for top 10 predictions. Additionally, to highlight its practical utility, we evaluate the model on text retrieval tasks and show that keyphrases generated by ERU-KG are effective when employed as query and document expansions. Furthermore, inference speed tests reveal that ERU-KG is the fastest among baselines of similar model sizes. Finally, our proposed model can switch between keyphrase generation and extraction by adjusting hyperparameters, catering to diverse application requirements.

Anthology ID:: 2025.acl-long.1398
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 28811–28829
Language:
URL:: https://aclanthology.org/2025.acl-long.1398/
DOI:: 10.18653/v1/2025.acl-long.1398
Bibkey:
Cite (ACL):: Lam Thanh Do, Aaditya Bodke, Pritom Saha Akash, and Kevin Chen-Chuan Chang. 2025. ERU-KG: Efficient Reference-aligned Unsupervised Keyphrase Generation. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 28811–28829, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: ERU-KG: Efficient Reference-aligned Unsupervised Keyphrase Generation (Do et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.1398.pdf

PDF Cite Search Fix data