Encoding Conversation Context for Neural Keyphrase Extraction from Microblog Posts

Yingyi Zhang, Jing Li, Yan Song, Chengzhi Zhang


Abstract
Existing keyphrase extraction methods suffer from data sparsity problem when they are conducted on short and informal texts, especially microblog messages. Enriching context is one way to alleviate this problem. Considering that conversations are formed by reposting and replying messages, they provide useful clues for recognizing essential content in target posts and are therefore helpful for keyphrase identification. In this paper, we present a neural keyphrase extraction framework for microblog posts that takes their conversation context into account, where four types of neural encoders, namely, averaged embedding, RNN, attention, and memory networks, are proposed to represent the conversation context. Experimental results on Twitter and Weibo datasets show that our framework with such encoders outperforms state-of-the-art approaches.
Anthology ID:
N18-1151
Volume:
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
Month:
June
Year:
2018
Address:
New Orleans, Louisiana
Editors:
Marilyn Walker, Heng Ji, Amanda Stent
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1676–1686
Language:
URL:
https://aclanthology.org/N18-1151
DOI:
10.18653/v1/N18-1151
Bibkey:
Cite (ACL):
Yingyi Zhang, Jing Li, Yan Song, and Chengzhi Zhang. 2018. Encoding Conversation Context for Neural Keyphrase Extraction from Microblog Posts. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 1676–1686, New Orleans, Louisiana. Association for Computational Linguistics.
Cite (Informal):
Encoding Conversation Context for Neural Keyphrase Extraction from Microblog Posts (Zhang et al., NAACL 2018)
Copy Citation:
PDF:
https://aclanthology.org/N18-1151.pdf