Whose Nickname is This? Recognizing Politicians from Their Aliases

Wei-Chung Wang, Hung-Chen Chen, Zhi-Kai Ji, Hui-I Hsiao, Yu-Shian Chiu, Lun-Wei Ku


Abstract
Using aliases to refer to public figures is one way to make fun of people, to express sarcasm, or even to sidestep legal issues when expressing opinions on social media. However, linking an alias back to the real name is difficult, as it entails phonemic, graphemic, and semantic challenges. In this paper, we propose a phonemic-based approach and inject semantic information to align aliases with politicians’ Chinese formal names. The proposed approach creates an HMM model for each name to model its phonemes and takes into account document-level pairwise mutual information to capture the semantic relations to the alias. In this work we also introduce two new datasets consisting of 167 phonemic pairs and 279 mixed pairs of aliases and formal names. Experimental results show that the proposed approach models both phonemic and semantic information and outperforms previous work on both the phonemic and mixed datasets with the best top-1 accuracies of 0.78 and 0.59 respectively.
Anthology ID:
W16-3910
Volume:
Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
WNUT | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
61–69
Language:
URL:
https://aclanthology.org/W16-3910
DOI:
Bibkey:
Cite (ACL):
Wei-Chung Wang, Hung-Chen Chen, Zhi-Kai Ji, Hui-I Hsiao, Yu-Shian Chiu, and Lun-Wei Ku. 2016. Whose Nickname is This? Recognizing Politicians from Their Aliases. In Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT), pages 61–69, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Whose Nickname is This? Recognizing Politicians from Their Aliases (Wang et al., 2016)
Copy Citation:
PDF:
https://aclanthology.org/W16-3910.pdf