Aligning Entity Names with Online Aliases on Twitter

Kevin McKelvey, Peter Goutzounis, Stephen da Cruz, Nathanael Chambers


Abstract
This paper presents new models that automatically align online aliases with their real entity names. Many research applications rely on identifying entity names in text, but people often refer to entities with unexpected nicknames and aliases. For example, The King and King James are aliases for Lebron James, a professional basketball player. Recent work on entity linking attempts to resolve mentions to knowledge base entries, like a wikipedia page, but linking is unfortunately limited to well-known entities with pre-built pages. This paper asks a more basic question: can aliases be aligned without background knowledge of the entity? Further, can the semantics surrounding alias mentions be used to inform alignments? We describe statistical models that make decisions based on the lexicographic properties of the aliases with their semantic context in a large corpus of tweets. We experiment on a database of Twitter users and their usernames, and present the first human evaluation for this task. Alignment accuracy approaches human performance at 81%, and we show that while lexicographic features are most important, the semantic context of an alias further improves classification accuracy.
Anthology ID:
W17-1104
Volume:
Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Lun-Wei Ku, Cheng-Te Li
Venue:
SocialNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
25–35
Language:
URL:
https://aclanthology.org/W17-1104
DOI:
10.18653/v1/W17-1104
Bibkey:
Cite (ACL):
Kevin McKelvey, Peter Goutzounis, Stephen da Cruz, and Nathanael Chambers. 2017. Aligning Entity Names with Online Aliases on Twitter. In Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media, pages 25–35, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Aligning Entity Names with Online Aliases on Twitter (McKelvey et al., SocialNLP 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-1104.pdf