PLN CMM at SocialDisNER: Improving Detection of Disease Mentions in Tweets by Using Document-Level Features

Matias Rojas, Jose Barros, Kinan Martin, Mauricio Araneda-Hernandez, Jocelyn Dunstan


Abstract
This paper describes our approaches used to solve the SocialDisNER task, which belongs to the Social Media Mining for Health Applications (SMM4H) shared task. This task aims to identify disease mentions in tweets written in Spanish. The proposed model is an architecture based on the FLERT approach. It consists of fine-tuning a language model that creates an input representation of a sentence based on its neighboring sentences, thus obtaining the document-level context. The best result was obtained using an ensemble of six language models using the FLERT approach. The system achieved an F1 score of 0.862, significantly surpassing the average performance among competitor models of 0.680 on the test partition.
Anthology ID:
2022.smm4h-1.15
Volume:
Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, Workshop & Shared Task
Month:
October
Year:
2022
Address:
Gyeongju, Republic of Korea
Editors:
Graciela Gonzalez-Hernandez, Davy Weissenbacher
Venue:
SMM4H
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
52–54
Language:
URL:
https://aclanthology.org/2022.smm4h-1.15
DOI:
Bibkey:
Cite (ACL):
Matias Rojas, Jose Barros, Kinan Martin, Mauricio Araneda-Hernandez, and Jocelyn Dunstan. 2022. PLN CMM at SocialDisNER: Improving Detection of Disease Mentions in Tweets by Using Document-Level Features. In Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, Workshop & Shared Task, pages 52–54, Gyeongju, Republic of Korea. Association for Computational Linguistics.
Cite (Informal):
PLN CMM at SocialDisNER: Improving Detection of Disease Mentions in Tweets by Using Document-Level Features (Rojas et al., SMM4H 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.smm4h-1.15.pdf