Hui-I Hsiao
2016
Whose Nickname is This? Recognizing Politicians from Their Aliases
Wei-Chung Wang
|
Hung-Chen Chen
|
Zhi-Kai Ji
|
Hui-I Hsiao
|
Yu-Shian Chiu
|
Lun-Wei Ku
Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT)
Using aliases to refer to public figures is one way to make fun of people, to express sarcasm, or even to sidestep legal issues when expressing opinions on social media. However, linking an alias back to the real name is difficult, as it entails phonemic, graphemic, and semantic challenges. In this paper, we propose a phonemic-based approach and inject semantic information to align aliases with politicians’ Chinese formal names. The proposed approach creates an HMM model for each name to model its phonemes and takes into account document-level pairwise mutual information to capture the semantic relations to the alias. In this work we also introduce two new datasets consisting of 167 phonemic pairs and 279 mixed pairs of aliases and formal names. Experimental results show that the proposed approach models both phonemic and semantic information and outperforms previous work on both the phonemic and mixed datasets with the best top-1 accuracies of 0.78 and 0.59 respectively.