READ-BioMed@SocialDisNER: Adaptation of an Annotation System to Spanish Tweets

Antonio Jimeno Yepes, Karin Verspoor


Abstract
We describe the work of the READ-BioMed team for the preparation of a submission to the SocialDisNER Disease Named Entity Recognition (NER) Task (Task 10) in 2022. We had developed a system for named entity recognition for identifying biomedical concepts in English MEDLINE citations and Spanish clinical text for the LivingNER 2022 challenge. Minimal adaptation of our system was required to perform named entity recognition in the Spanish tweets in the SocialDisNER task, given the availability of Spanish pre-trained language models and the SocialDisNER training data. Minor additions included treatment of emojis and entities in hashtags and Twitter account names.
Anthology ID:
2022.smm4h-1.14
Volume:
Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, Workshop & Shared Task
Month:
October
Year:
2022
Address:
Gyeongju, Republic of Korea
Editors:
Graciela Gonzalez-Hernandez, Davy Weissenbacher
Venue:
SMM4H
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
48–51
Language:
URL:
https://aclanthology.org/2022.smm4h-1.14
DOI:
Bibkey:
Cite (ACL):
Antonio Jimeno Yepes and Karin Verspoor. 2022. READ-BioMed@SocialDisNER: Adaptation of an Annotation System to Spanish Tweets. In Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, Workshop & Shared Task, pages 48–51, Gyeongju, Republic of Korea. Association for Computational Linguistics.
Cite (Informal):
READ-BioMed@SocialDisNER: Adaptation of an Annotation System to Spanish Tweets (Jimeno Yepes & Verspoor, SMM4H 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.smm4h-1.14.pdf