Generating and using probabilistic morphological resources for the biomedical domain

Vincent Claveau, Ewa Kijak


Abstract
In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau et al. 2011,Claveau et al. 12), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds... When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources.
Anthology ID:
L14-1004
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3348–3354
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1003_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Vincent Claveau and Ewa Kijak. 2014. Generating and using probabilistic morphological resources for the biomedical domain. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3348–3354, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Generating and using probabilistic morphological resources for the biomedical domain (Claveau & Kijak, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1003_Paper.pdf