Community Member Retrieval on Social Media Using Textual Information

Aaron Jaech, Shobhit Hathi, Mari Ostendorf


Abstract
This paper addresses the problem of community membership detection using only text features in a scenario where a small number of positive labeled examples defines the community. The solution introduces an unsupervised proxy task for learning user embeddings: user re-identification. Experiments with 16 different communities show that the resulting embeddings are more effective for community membership identification than common unsupervised representations.
Anthology ID:
N18-2094
Volume:
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
Month:
June
Year:
2018
Address:
New Orleans, Louisiana
Editors:
Marilyn Walker, Heng Ji, Amanda Stent
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
595–601
Language:
URL:
https://aclanthology.org/N18-2094
DOI:
10.18653/v1/N18-2094
Bibkey:
Cite (ACL):
Aaron Jaech, Shobhit Hathi, and Mari Ostendorf. 2018. Community Member Retrieval on Social Media Using Textual Information. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pages 595–601, New Orleans, Louisiana. Association for Computational Linguistics.
Cite (Informal):
Community Member Retrieval on Social Media Using Textual Information (Jaech et al., NAACL 2018)
Copy Citation:
PDF:
https://aclanthology.org/N18-2094.pdf
Code
 ajaech/twittercommunities