Completing the Princeton Annotated Gloss Corpus Project

Alexandre Rademaker, Bruno Cuconato, Alessandra Cid, Alexandre Tessarollo, Henrique Andrade


Abstract
In the Princeton WordNet Gloss Corpus, the word forms from the definitions (“glosses”) in WordNet’s synsets are manually linked to the context-appropriate sense in the WordNet. The glosses then become a sense-disambiguated corpus annotated against WordNet version 3.0. The result is also called a semantic concordance, which can be seen as both a lexicon (WordNet extension) and an annotated corpus. In this work we motivate and present the initial steps to complete the annotation of all open-class words in this corpus. Finally, we introduce a freely-available annotation interface built as an Emacs extension, and evaluate a preliminary annotation effort.
Anthology ID:
2019.gwc-1.48
Volume:
Proceedings of the 10th Global Wordnet Conference
Month:
July
Year:
2019
Address:
Wroclaw, Poland
Editors:
Piek Vossen, Christiane Fellbaum
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
378–386
Language:
URL:
https://aclanthology.org/2019.gwc-1.48
DOI:
Bibkey:
Cite (ACL):
Alexandre Rademaker, Bruno Cuconato, Alessandra Cid, Alexandre Tessarollo, and Henrique Andrade. 2019. Completing the Princeton Annotated Gloss Corpus Project. In Proceedings of the 10th Global Wordnet Conference, pages 378–386, Wroclaw, Poland. Global Wordnet Association.
Cite (Informal):
Completing the Princeton Annotated Gloss Corpus Project (Rademaker et al., GWC 2019)
Copy Citation:
PDF:
https://aclanthology.org/2019.gwc-1.48.pdf