Probing for Hyperbole in Pre-Trained Language Models

Nina Schneidermann; Daniel Hershcovich; Bolette Sandford Pedersen

doi:10.18653/v1/2023.acl-srw.30

Probing for Hyperbole in Pre-Trained Language Models

Nina Schneidermann, Daniel Hershcovich, Bolette Pedersen

Abstract

Hyperbole is a common figure of speech, which is under-explored in NLP research. In this study, we conduct edge and minimal description length (MDL) probing experiments on three pre-trained language models (PLMs) in an attempt to explore the extent to which hyperbolic information is encoded in these models. We use both word-in-context and sentence-level representations as model inputs as a basis for comparison. We also annotate 63 hyperbole sentences from the HYPO dataset according to an operational taxonomy to conduct an error analysis to explore the encoding of different hyperbole categories. Our results show that hyperbole is to a limited extent encoded in PLMs, and mostly in the final layers. They also indicate that hyperbolic information may be better encoded by the sentence-level representations, which, due to the pragmatic nature of hyperbole, may therefore provide a more accurate and informative representation in PLMs. Finally, the inter-annotator agreement for our annotations, a Cohen’s Kappa of 0.339, suggest that the taxonomy categories may not be intuitive and need revision or simplification.

Anthology ID:: 2023.acl-srw.30
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Vishakh Padmakumar, Gisela Vallejo, Yao Fu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 200–211
Language:
URL:: https://aclanthology.org/2023.acl-srw.30
DOI:: 10.18653/v1/2023.acl-srw.30
Bibkey:
Cite (ACL):: Nina Schneidermann, Daniel Hershcovich, and Bolette Pedersen. 2023. Probing for Hyperbole in Pre-Trained Language Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), pages 200–211, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Probing for Hyperbole in Pre-Trained Language Models (Schneidermann et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-srw.30.pdf
Video:: https://aclanthology.org/2023.acl-srw.30.mp4

PDF Cite Search Video