Can Large Language Models Understand Context?

Yilun Zhu; Joel Ruben Antony Moniz; Shruti Bhargava; Jiarui Lu; Dhivya Piraviperumal; Site Li; Yuan Zhang; Hong Yu; Bo-Hsiang Tseng

doi:10.18653/v1/2024.findings-eacl.135

Can Large Language Models Understand Context?

Yilun Zhu, Joel Ruben Antony Moniz, Shruti Bhargava, Jiarui Lu, Dhivya Piraviperumal, Site Li, Yuan Zhang, Hong Yu, Bo-Hsiang Tseng

Abstract

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of LLMs encompasses various domains within the realm of Natural Language Processing, limited attention has been paid to probing their linguistic capability of understanding contextual features. This paper introduces a context understanding benchmark by adapting existing datasets to suit the evaluation of generative models. This benchmark comprises of four distinct tasks and nine datasets, all featuring prompts designed to assess the models’ ability to understand context. First, we evaluate the performance of LLMs under the in-context learning pretraining scenario. Experimental results indicate that pre-trained dense models struggle with understanding more nuanced contextual features when compared to state-of-the-art fine-tuned models. Second, as LLM compression holds growing significance in both research and real-world applications, we assess the context understanding of quantized models under in-context-learning settings. We find that 3-bit post-training quantization leads to varying degrees of performance reduction on our benchmark. We conduct an extensive analysis of these scenarios to substantiate our experimental results.

Anthology ID:: 2024.findings-eacl.135
Volume:: Findings of the Association for Computational Linguistics: EACL 2024
Month:: March
Year:: 2024
Address:: St. Julian’s, Malta
Editors:: Yvette Graham, Matthew Purver
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2004–2018
Language:
URL:: https://aclanthology.org/2024.findings-eacl.135/
DOI:: 10.18653/v1/2024.findings-eacl.135
Bibkey:
Cite (ACL):: Yilun Zhu, Joel Ruben Antony Moniz, Shruti Bhargava, Jiarui Lu, Dhivya Piraviperumal, Site Li, Yuan Zhang, Hong Yu, and Bo-Hsiang Tseng. 2024. Can Large Language Models Understand Context?. In Findings of the Association for Computational Linguistics: EACL 2024, pages 2004–2018, St. Julian’s, Malta. Association for Computational Linguistics.
Cite (Informal):: Can Large Language Models Understand Context? (Zhu et al., Findings 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.findings-eacl.135.pdf
Video:: https://aclanthology.org/2024.findings-eacl.135.mp4

PDF Cite Search Video Fix data