Fine-tuning BERT to classify COVID19 tweets containing symptoms

Rajarshi Roychoudhury, Sudip Naskar


Abstract
Twitter is a valuable source of patient-generated data that has been used in various population health studies. The first step in many of these studies is to identify and capture Twitter messages (tweets) containing medication mentions. Identifying personal mentions of COVID19 symptoms requires distinguishing personal mentions from other mentions such as symptoms reported by others and references to news articles or other sources. In this article, we describe our submission to Task 6 of the Social Media Mining for Health Applications (SMM4H) Shared Task 2021. This task challenged participants to classify tweets where the target classes are:(1) self-reports,(2) non-personal reports, and (3) literature/news mentions. Our system used a handcrafted preprocessing and word embeddings from BERT encoder model. We achieved an F1 score of 93%
Anthology ID:
2021.smm4h-1.30
Volume:
Proceedings of the Sixth Social Media Mining for Health (#SMM4H) Workshop and Shared Task
Month:
June
Year:
2021
Address:
Mexico City, Mexico
Editors:
Arjun Magge, Ari Klein, Antonio Miranda-Escalada, Mohammed Ali Al-garadi, Ilseyar Alimova, Zulfat Miftahutdinov, Eulalia Farre-Maduell, Salvador Lima Lopez, Ivan Flores, Karen O'Connor, Davy Weissenbacher, Elena Tutubalina, Abeed Sarker, Juan M Banda, Martin Krallinger, Graciela Gonzalez-Hernandez
Venue:
SMM4H
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
138–140
Language:
URL:
https://aclanthology.org/2021.smm4h-1.30
DOI:
10.18653/v1/2021.smm4h-1.30
Bibkey:
Cite (ACL):
Rajarshi Roychoudhury and Sudip Naskar. 2021. Fine-tuning BERT to classify COVID19 tweets containing symptoms. In Proceedings of the Sixth Social Media Mining for Health (#SMM4H) Workshop and Shared Task, pages 138–140, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):
Fine-tuning BERT to classify COVID19 tweets containing symptoms (Roychoudhury & Naskar, SMM4H 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.smm4h-1.30.pdf