Automatic Identification of 5C Vaccine Behaviour on Social Media

Ajay Hemanth Sampath Kumar, Aminath Shausan, Gianluca Demartini, Afshin Rahimi


Abstract
Monitoring vaccine behaviour through social media can guide health policy. We present a new dataset of 9471 tweets posted in Australia from 2020 to 2022, annotated with sentiment toward vaccines and also 5C, the five types of behaviour toward vaccines, a scheme commonly used in health psychology literature. We benchmark our dataset using BERT and Gradient Boosting Machine and show that jointly training both sentiment and 5C tasks (F1=48) outperforms individual training (F1=39) in this highly imbalanced data. Our sentiment analysis indicates close correlation between the sentiments and prominent events during the pandemic. We hope that our dataset and benchmark models will inform further work in online monitoring of vaccine behaviour. The dataset and benchmark methods are accessible online.
Anthology ID:
2022.wnut-1.15
Volume:
Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022)
Month:
October
Year:
2022
Address:
Gyeongju, Republic of Korea
Venue:
WNUT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
136–146
Language:
URL:
https://aclanthology.org/2022.wnut-1.15
DOI:
Bibkey:
Cite (ACL):
Ajay Hemanth Sampath Kumar, Aminath Shausan, Gianluca Demartini, and Afshin Rahimi. 2022. Automatic Identification of 5C Vaccine Behaviour on Social Media. In Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022), pages 136–146, Gyeongju, Republic of Korea. Association for Computational Linguistics.
Cite (Informal):
Automatic Identification of 5C Vaccine Behaviour on Social Media (Sampath Kumar et al., WNUT 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.wnut-1.15.pdf