Identifying 1950s American Jazz Musicians: Fine-Grained IsA Extraction via Modifier Composition

Ellie Pavlick, Marius Paşca


Abstract
We present a method for populating fine-grained classes (e.g., “1950s American jazz musicians”) with instances (e.g., Charles Mingus ). While state-of-the-art methods tend to treat class labels as single lexical units, the proposed method considers each of the individual modifiers in the class label relative to the head. An evaluation on the task of reconstructing Wikipedia category pages demonstrates a >10 point increase in AUC, over a strong baseline relying on widely-used Hearst patterns.
Anthology ID:
P17-1192
Volume:
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2017
Address:
Vancouver, Canada
Editors:
Regina Barzilay, Min-Yen Kan
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2099–2109
Language:
URL:
https://aclanthology.org/P17-1192
DOI:
10.18653/v1/P17-1192
Bibkey:
Cite (ACL):
Ellie Pavlick and Marius Paşca. 2017. Identifying 1950s American Jazz Musicians: Fine-Grained IsA Extraction via Modifier Composition. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2099–2109, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):
Identifying 1950s American Jazz Musicians: Fine-Grained IsA Extraction via Modifier Composition (Pavlick & Paşca, ACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/P17-1192.pdf
Note:
 P17-1192.Notes.pdf
Dataset:
 P17-1192.Datasets.tgz