Early Discovery of Disappearing Entities in Microblogs

Satoshi Akasaki, Naoki Yoshinaga, Masashi Toyoda


Abstract
We make decisions by reacting to changes in the real world, particularly the emergence and disappearance of impermanent entities such as restaurants, services, and events. Because we want to avoid missing out on opportunities or making fruitless actions after those entities have disappeared, it is important to know when entities disappear as early as possible. We thus tackle the task of detecting disappearing entities from microblogs where various information is shared timely. The major challenge is detecting uncertain contexts of disappearing entities from noisy microblog posts. To collect such disappearing contexts, we design time-sensitive distant supervision, which utilizes entities from the knowledge base and time-series posts. Using this method, we actually build large-scale Twitter datasets of disappearing entities. To ensure robust detection in noisy environments, we refine pretrained word embeddings for the detection model on microblog streams in a timely manner. Experimental results on the Twitter datasets confirmed the effectiveness of the collected labeled data and refined word embeddings; the proposed method outperformed a baseline in terms of accuracy, and more than 70% of the detected disappearing entities in Wikipedia are discovered earlier than the update on Wikipedia, with the average lead-time is over one month.
Anthology ID:
2023.acl-long.247
Volume:
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4507–4520
Language:
URL:
https://aclanthology.org/2023.acl-long.247
DOI:
10.18653/v1/2023.acl-long.247
Bibkey:
Cite (ACL):
Satoshi Akasaki, Naoki Yoshinaga, and Masashi Toyoda. 2023. Early Discovery of Disappearing Entities in Microblogs. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4507–4520, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Early Discovery of Disappearing Entities in Microblogs (Akasaki et al., ACL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.acl-long.247.pdf
Video:
 https://aclanthology.org/2023.acl-long.247.mp4