A new semantically annotated corpus with syntactic-semantic and cross-lingual senses

Myriam Rakho, Éric Laporte, Matthieu Constant


Abstract
In this article, we describe a new sense-tagged corpus for Word Sense Disambiguation. The corpus is constituted of instances of 20 French polysemous verbs. Each verb instance is annotated with three sense labels: (1) the actual translation of the verb in the english version of this instance in a parallel corpus, (2) an entry of the verb in a computational dictionary of French (the Lexicon-Grammar tables) and (3) a fine-grained sense label resulting from the concatenation of the translation and the Lexicon-Grammar entry.
Anthology ID:
L12-1234
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
597–600
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/447_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Myriam Rakho, Éric Laporte, and Matthieu Constant. 2012. A new semantically annotated corpus with syntactic-semantic and cross-lingual senses. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 597–600, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
A new semantically annotated corpus with syntactic-semantic and cross-lingual senses (Rakho et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/447_Paper.pdf