Figure Me Out: A Gold Standard Dataset for Metaphor Interpretation

Omnia Zayed, John Philip McCrae, Paul Buitelaar


Abstract
Metaphor comprehension and understanding is a complex cognitive task that requires interpreting metaphors by grasping the interaction between the meaning of their target and source concepts. This is very challenging for humans, let alone computers. Thus, automatic metaphor interpretation is understudied in part due to the lack of publicly available datasets. The creation and manual annotation of such datasets is a demanding task which requires huge cognitive effort and time. Moreover, there will always be a question of accuracy and consistency of the annotated data due to the subjective nature of the problem. This work addresses these issues by presenting an annotation scheme to interpret verb-noun metaphoric expressions in text. The proposed approach is designed with the goal of reducing the workload on annotators and maintain consistency. Our methodology employs an automatic retrieval approach which utilises external lexical resources, word embeddings and semantic similarity to generate possible interpretations of identified metaphors in order to enable quick and accurate annotation. We validate our proposed approach by annotating around 1,500 metaphors in tweets which were annotated by six native English speakers. As a result of this work, we publish as linked data the first gold standard dataset for metaphor interpretation which will facilitate research in this area.
Anthology ID:
2020.lrec-1.712
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5810–5819
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.712
DOI:
Bibkey:
Cite (ACL):
Omnia Zayed, John Philip McCrae, and Paul Buitelaar. 2020. Figure Me Out: A Gold Standard Dataset for Metaphor Interpretation. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 5810–5819, Marseille, France. European Language Resources Association.
Cite (Informal):
Figure Me Out: A Gold Standard Dataset for Metaphor Interpretation (Zayed et al., LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.712.pdf