Annotation of Entities and Relations in Spanish Radiology Reports

Viviana Cotik, Darío Filippo, Roland Roller, Hans Uszkoreit, Feiyu Xu


Abstract
Radiology reports express the results of a radiology study and contain information about anatomical entities, findings, measures and impressions of the medical doctor. The use of information extraction techniques can help physicians to access this information in order to understand data and to infer further knowledge. Supervised machine learning methods are very popular to address information extraction, but are usually domain and language dependent. To train new classification models, annotated data is required. Moreover, annotated data is also required as an evaluation resource of information extraction algorithms. However, one major drawback of processing clinical data is the low availability of annotated datasets. For this reason we performed a manual annotation of radiology reports written in Spanish. This paper presents the corpus, the annotation schema, the annotation guidelines and further insight of the data.
Anthology ID:
R17-1025
Volume:
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
Month:
September
Year:
2017
Address:
Varna, Bulgaria
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
177–184
Language:
URL:
https://doi.org/10.26615/978-954-452-049-6_025
DOI:
10.26615/978-954-452-049-6_025
Bibkey:
Cite (ACL):
Viviana Cotik, Darío Filippo, Roland Roller, Hans Uszkoreit, and Feiyu Xu. 2017. Annotation of Entities and Relations in Spanish Radiology Reports. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 177–184, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
Annotation of Entities and Relations in Spanish Radiology Reports (Cotik et al., RANLP 2017)
Copy Citation:
PDF:
https://doi.org/10.26615/978-954-452-049-6_025