Exploiting Position and Contextual Word Embeddings for Keyphrase Extraction from Scientific Papers

Krutarth Patel, Cornelia Caragea


Abstract
Keyphrases associated with research papers provide an effective way to find useful information in the large and growing scholarly digital collections. In this paper, we present KPRank, an unsupervised graph-based algorithm for keyphrase extraction that exploits both positional information and contextual word embeddings into a biased PageRank. Our experimental results on five benchmark datasets show that KPRank that uses contextual word embeddings with additional position signal outperforms previous approaches and strong baselines for this task.
Anthology ID:
2021.eacl-main.136
Volume:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
Month:
April
Year:
2021
Address:
Online
Editors:
Paola Merlo, Jorg Tiedemann, Reut Tsarfaty
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1585–1591
Language:
URL:
https://aclanthology.org/2021.eacl-main.136
DOI:
10.18653/v1/2021.eacl-main.136
Bibkey:
Cite (ACL):
Krutarth Patel and Cornelia Caragea. 2021. Exploiting Position and Contextual Word Embeddings for Keyphrase Extraction from Scientific Papers. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 1585–1591, Online. Association for Computational Linguistics.
Cite (Informal):
Exploiting Position and Contextual Word Embeddings for Keyphrase Extraction from Scientific Papers (Patel & Caragea, EACL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.eacl-main.136.pdf