SINAI at #SMM4H–HeaRD 2026: Multilingual Clinical NER with MrBERT-biomed and Optuna Hyperparameter Optimization

Lucas Molino Piñar; Manuel Carlos Diaz-Galiano; María-Teresa Martín-Valdivia

SINAI at #SMM4H–HeaRD 2026: Multilingual Clinical NER with MrBERT-biomed and Optuna Hyperparameter Optimization

Lucas Molino Piñar, Manuel Carlos Diaz-Galiano, María-Teresa Martín-Valdivia

Abstract

This paper describes the system submitted by our team to the MultiClinAI shared task at the 11th SMM4H-HeaRD Workshop (ACL 2026). The task addresses multilingual clinical Named Entity Recognition (NER) for three entity types (Disease, Procedure, and Symptom) in Spanish clinical texts. Our approach fine-tunes MrBERT-biomed, a domain-adapted ModernBERT model pre-trained on biomedical corpora, using multilingual clinical data from seven European languages. We train independent entity-specific models, each optimized via Bayesian hyperparameter search with Optuna, and apply a deterministic post-processing step that aligns predicted spans to word boundaries. On the official test set, our system achieves overall strict micro-F1 scores of 0.7453, 0.7107, and 0.6603 for Disease, Procedure, and Symptom, respectively.

Anthology ID:: 2026.smm4h-1.4
Volume:: Proceedings of the 11th Social Media Mining for Health Research and Applications (SMM4H-HeaRD 2026) Workshop and Shared Tasks
Month:: July
Year:: 2026
Address:: San Diego, United States
Editors:: Guillermo Lopez-Garcia, Graciela Gonzalez-Hernandez
Venues:: SMM4H | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 18–22
Language:
URL:: https://aclanthology.org/2026.smm4h-1.4/
DOI:
Bibkey:
Cite (ACL):: Lucas Molino Piñar, Manuel Carlos Diaz-Galiano, and María-Teresa Martín-Valdivia. 2026. SINAI at #SMM4H–HeaRD 2026: Multilingual Clinical NER with MrBERT-biomed and Optuna Hyperparameter Optimization. In Proceedings of the 11th Social Media Mining for Health Research and Applications (SMM4H-HeaRD 2026) Workshop and Shared Tasks, pages 18–22, San Diego, United States. Association for Computational Linguistics.
Cite (Informal):: SINAI at #SMM4H–HeaRD 2026: Multilingual Clinical NER with MrBERT-biomed and Optuna Hyperparameter Optimization (Piñar et al., SMM4H 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.smm4h-1.4.pdf

PDF Cite Search Fix data