Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations

Beatrice Alex, Clare Llewellyn, Claire Grover, Jon Oberlander, Richard Tobin


Abstract
Twitter-related studies often need to geo-locate Tweets or Twitter users, identifying their real-world geographic locations. As tweet-level geotagging remains rare, most prior work exploited tweet content, timezone and network information to inform geolocation, or else relied on off-the-shelf tools to geolocate users from location information in their user profiles. However, such user location metadata is not consistently structured, causing such tools to fail regularly, especially if a string contains multiple locations, or if locations are very fine-grained. We argue that user profile location (UPL) and tweet location need to be treated as distinct types of information from which differing inferences can be drawn. Here, we apply geoparsing to UPLs, and demonstrate how task performance can be improved by adapting our Edinburgh Geoparser, which was originally developed for processing English text. We present a detailed evaluation method and results, including inter-coder agreement. We demonstrate that the optimised geoparser can effectively extract and geo-reference multiple locations at different levels of granularity with an F1-score of around 0.90. We also illustrate how geoparsed UPLs can be exploited for international information trade studies and country-level sentiment analysis.
Anthology ID:
L16-1622
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3936–3944
Language:
URL:
https://aclanthology.org/L16-1622
DOI:
Bibkey:
Cite (ACL):
Beatrice Alex, Clare Llewellyn, Claire Grover, Jon Oberlander, and Richard Tobin. 2016. Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3936–3944, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations (Alex et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1622.pdf