It is not a piece of cake for GPT: Explaining Textual Entailment Recognition in the presence of Figurative Language

Giuseppe Gallipoli, Luca Cagliero


Abstract
Textual Entailment Recognition (TER) aims to predict whether a pair of premise-hypothesis sentences represents an entailment, a contradiction, or none of the above. Addressing TER in the presence of figurative language is particularly challenging because words are used in a way that deviates from the conventional order and meaning. In this work, we investigate the capabilities of Large Language Models (LLMs) to address TER and generate textual explanations of TER predictions. First, we evaluate LLM performance in Zero- and Few-Shot Learning settings, with and without using Chain-of-Thought prompting. After identifying the best prompts, we highlight the settings in which in-context learning is beneficial. The closed-source models GPT-3.5 Turbo and GPT-4o show unexpected limitations compared to significantly smaller open-source LLMs. Next, we thoroughly analyze the effect of LLM Fine-Tuning, showing substantial improvements in the quality of TER explanations compared to Zero- and Few-Shot Learning. Notably, 9 billion parameter open-source LLMs demonstrate again competitive performance against larger closed-source models. Finally, we compare our LLM-based approach with the state-of-the-art DREAM-FLUTE and Cross-Task architectures. The results show significant performance improvements, particularly in the quality of the generated explanations.
Anthology ID:
2025.coling-main.646
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9656–9674
Language:
URL:
https://aclanthology.org/2025.coling-main.646/
DOI:
Bibkey:
Cite (ACL):
Giuseppe Gallipoli and Luca Cagliero. 2025. It is not a piece of cake for GPT: Explaining Textual Entailment Recognition in the presence of Figurative Language. In Proceedings of the 31st International Conference on Computational Linguistics, pages 9656–9674, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
It is not a piece of cake for GPT: Explaining Textual Entailment Recognition in the presence of Figurative Language (Gallipoli & Cagliero, COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.646.pdf