Annotation Guidelines and Challenges for Automatic Simplification of Portuguese Drug Leaflets

Arthur Scalercio, Eduarda Bertotto, Silvana Jesus, Maria José Finatto, Aline Paes


Abstract
While most essential medicines have become widely accessible across all social strata in Brazil due to government initiatives and market shifts, a significant barrier remains: the technical complexity of medication leaflets. This pragmatic and linguistic gap hinders patient comprehension of critical risks and benefits. Thus, adapting these texts into plain language patterns is crucial for patient safety and treatment adherence. Large language models have been increasingly effective as practical solutions for text simplification, an important Natural Language Processing (NLP) task that serves as a basis for several other linguistic and computational tasks. However, the scarcity of annotated datasets remains a bottleneck for rigorous evaluation. To bridge this gap, we propose a streamlined pipeline for generating simplified medical leaflets and introduce an initial benchmark dataset of 30 expertly annotated samples. Our results, supported by semantic and morphosyntactic evaluations, demonstrate that the proposed method produces high-quality, simplified content suitable for health applications.
Anthology ID:
2026.propor-2.19
Volume:
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 2
Month:
April
Year:
2026
Address:
Salvador, Brazil
Editors:
Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:
PROPOR
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
121–127
Language:
URL:
https://aclanthology.org/2026.propor-2.19/
DOI:
Bibkey:
Cite (ACL):
Arthur Scalercio, Eduarda Bertotto, Silvana Jesus, Maria José Finatto, and Aline Paes. 2026. Annotation Guidelines and Challenges for Automatic Simplification of Portuguese Drug Leaflets. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 2, pages 121–127, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):
Annotation Guidelines and Challenges for Automatic Simplification of Portuguese Drug Leaflets (Scalercio et al., PROPOR 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.propor-2.19.pdf