Clinical Case Reports for NLP

Cyril Grouin, Natalia Grabar, Vincent Claveau, Thierry Hamon


Abstract
Textual data are useful for accessing expert information. Yet, since the texts are representative of distinct language uses, it is necessary to build specific corpora in order to be able to design suitable NLP tools. In some domains, such as medical domain, it may be complicated to access the representative textual data and their semantic annotations, while there exists a real need for providing efficient tools and methods. Our paper presents a corpus of clinical cases written in French, and their semantic annotations. Thus, we manually annotated a set of 717 files into four general categories (age, gender, outcome, and origin) for a total number of 2,835 annotations. The values of age, gender, and outcome are normalized. A subset with 70 files has been additionally manually annotated into 27 categories for a total number of 5,198 annotations.
Anthology ID:
W19-5029
Volume:
Proceedings of the 18th BioNLP Workshop and Shared Task
Month:
August
Year:
2019
Address:
Florence, Italy
Editors:
Dina Demner-Fushman, Kevin Bretonnel Cohen, Sophia Ananiadou, Junichi Tsujii
Venue:
BioNLP
SIG:
SIGBIOMED
Publisher:
Association for Computational Linguistics
Note:
Pages:
273–282
Language:
URL:
https://aclanthology.org/W19-5029
DOI:
10.18653/v1/W19-5029
Bibkey:
Cite (ACL):
Cyril Grouin, Natalia Grabar, Vincent Claveau, and Thierry Hamon. 2019. Clinical Case Reports for NLP. In Proceedings of the 18th BioNLP Workshop and Shared Task, pages 273–282, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Clinical Case Reports for NLP (Grouin et al., BioNLP 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-5029.pdf