Inducing Universal Semantic Tag Vectors

Da Huo, Gerard de Melo


Abstract
Given the well-established usefulness of part-of-speech tag annotations in many syntactically oriented downstream NLP tasks, the recently proposed notion of semantic tagging (Bjerva et al. 2016) aims at tagging words with tags informed by semantic distinctions, which are likely to be useful across a range of semantic tasks. To this end, their annotation scheme distinguishes, for instance, privative attributes from subsective ones. While annotated corpora exist, their size is limited and thus many words are out-of-vocabulary words. In this paper, we study to what extent we can automatically predict the tags associated with unseen words. We draw on large-scale word representation data to derive a large new Semantic Tag lexicon. Our experiments show that we can infer semantic tags for words with high accuracy both monolingually and cross-lingually.
Anthology ID:
2020.lrec-1.382
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3121–3127
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.382
DOI:
Bibkey:
Cite (ACL):
Da Huo and Gerard de Melo. 2020. Inducing Universal Semantic Tag Vectors. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 3121–3127, Marseille, France. European Language Resources Association.
Cite (Informal):
Inducing Universal Semantic Tag Vectors (Huo & de Melo, LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.382.pdf