A Laypeople Study on Terminology Identification across Domains and Task Definitions

Anna Hätty, Sabine Schulte im Walde


Abstract
This paper introduces a new dataset of term annotation. Given that even experts vary significantly in their understanding of termhood, and that term identification is mostly performed as a binary task, we offer a novel perspective to explore the common, natural understanding of what constitutes a term: Laypeople annotate single-word and multi-word terms, across four domains and across four task definitions. Analyses based on inter-annotator agreement offer insights into differences in term specificity, term granularity and subtermhood.
Anthology ID:
N18-2052
Volume:
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
Month:
June
Year:
2018
Address:
New Orleans, Louisiana
Editors:
Marilyn Walker, Heng Ji, Amanda Stent
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
321–326
Language:
URL:
https://aclanthology.org/N18-2052
DOI:
10.18653/v1/N18-2052
Bibkey:
Cite (ACL):
Anna Hätty and Sabine Schulte im Walde. 2018. A Laypeople Study on Terminology Identification across Domains and Task Definitions. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pages 321–326, New Orleans, Louisiana. Association for Computational Linguistics.
Cite (Informal):
A Laypeople Study on Terminology Identification across Domains and Task Definitions (Hätty & Schulte im Walde, NAACL 2018)
Copy Citation:
PDF:
https://aclanthology.org/N18-2052.pdf