CAS: French Corpus with Clinical Cases

Natalia Grabar, Vincent Claveau, Clément Dalloux


Abstract
Textual corpora are extremely important for various NLP applications as they provide information necessary for creating, setting and testing these applications and the corresponding tools. They are also crucial for designing reliable methods and reproducible results. Yet, in some areas, such as the medical area, due to confidentiality or to ethical reasons, it is complicated and even impossible to access textual data representative of those produced in these areas. We propose the CAS corpus built with clinical cases, such as they are reported in the published scientific literature in French. We describe this corpus, currently containing over 397,000 word occurrences, and the existing linguistic and semantic annotations.
Anthology ID:
W18-5614
Volume:
Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis
Month:
October
Year:
2018
Address:
Brussels, Belgium
Editors:
Alberto Lavelli, Anne-Lyse Minard, Fabio Rinaldi
Venue:
Louhi
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
122–128
Language:
URL:
https://aclanthology.org/W18-5614
DOI:
10.18653/v1/W18-5614
Bibkey:
Cite (ACL):
Natalia Grabar, Vincent Claveau, and Clément Dalloux. 2018. CAS: French Corpus with Clinical Cases. In Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis, pages 122–128, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
CAS: French Corpus with Clinical Cases (Grabar et al., Louhi 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-5614.pdf