Overview of #SMM4H-HeaRD 2026 - Task 2: Detection of Insomnia in Clinical Notes

Joey Chan; Lauren D. Gryboski; Guillermo Lopez-Garcia; Graciela Gonzalez-Hernandez

Overview of #SMM4H-HeaRD 2026 - Task 2: Detection of Insomnia in Clinical Notes

Joey Chan, Lauren D. Gryboski, Guillermo Lopez-Garcia, Graciela Gonzalez-Hernandez

Abstract

This paper provides an overview of Task 2 from the Social Media Mining for Health and Health Real-World Data (#SMM4H-HeaRD) 2026 Workshop and Shared Tasks, which focused on the detection of insomnia in clinical notes derived from the MIMIC-III dataset. The task consisted of two subtasks: binary text classification to determine whether a patient is likely experiencing insomnia (Subtask 1), and multi-label classification combined with character-level evidence extraction to identify supporting evidence for specific insomnia crite- ria (Subtask 2). Eight teams participated, using approaches ranging from large language model (LLM) prompting and fine-tuned encoder mod- els to hybrid rule-based pipelines. Results demonstrated that structured LLM pipelines with deterministic post-processing achieved the strongest overall performance, while character-level span extraction remained substantially harder than classification across all systems. These findings highlight both the promise of NLP for identifying underdiagnosed conditions in electronic health records and the ongoing difficulty of producing interpretable, evidence-grounded clinical predictions.

Anthology ID:: 2026.smm4h-1.52
Volume:: Proceedings of the 11th Social Media Mining for Health Research and Applications (SMM4H-HeaRD 2026) Workshop and Shared Tasks
Month:: July
Year:: 2026
Address:: San Diego, United States
Editors:: Guillermo Lopez-Garcia, Graciela Gonzalez-Hernandez
Venues:: SMM4H | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 345–352
Language:
URL:: https://aclanthology.org/2026.smm4h-1.52/
DOI:
Bibkey:
Cite (ACL):: Joey Chan, Lauren D. Gryboski, Guillermo Lopez-Garcia, and Graciela Gonzalez-Hernandez. 2026. Overview of #SMM4H-HeaRD 2026 - Task 2: Detection of Insomnia in Clinical Notes. In Proceedings of the 11th Social Media Mining for Health Research and Applications (SMM4H-HeaRD 2026) Workshop and Shared Tasks, pages 345–352, San Diego, United States. Association for Computational Linguistics.
Cite (Informal):: Overview of #SMM4H-HeaRD 2026 - Task 2: Detection of Insomnia in Clinical Notes (Chan et al., SMM4H 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.smm4h-1.52.pdf

PDF Cite Search Fix data