ICLEval: Evaluating In-Context Learning Ability of Large Language Models

Wentong Chen; Yankai Lin; ZhenHao Zhou; HongYun Huang; Yantao Jia; Zhao Cao; Ji-Rong Wen

ICLEval: Evaluating In-Context Learning Ability of Large Language Models

Wentong Chen, Yankai Lin, ZhenHao Zhou, HongYun Huang, YanTao Jia, Zhao Cao, Ji-Rong Wen

Abstract

In-Context Learning (ICL) is a critical capability of Large Language Models (LLMs) as it empowers them to comprehend and reason across interconnected inputs. Evaluating the ICL ability of LLMs can enhance their utilization and deepen our understanding of how this ability is acquired at the training stage. However, existing evaluation frameworks primarily focus on language abilities and knowledge, often overlooking the assessment of ICL ability. In this work, we introduce the ICLEval benchmark to evaluate the ICL abilities of LLMs, which encompasses two key sub-abilities: exact copying and rule learning. Through the ICLEval benchmark, we demonstrate that ICL ability is universally present in different LLMs, and model size is not the sole determinant of ICL efficacy. Surprisingly, we observe that ICL abilities, particularly copying, develop early in the pretraining process and stabilize afterward.

Anthology ID:: 2025.coling-main.693
Volume:: Proceedings of the 31st International Conference on Computational Linguistics
Month:: January
Year:: 2025
Address:: Abu Dhabi, UAE
Editors:: Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:: COLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 10398–10422
Language:
URL:: https://aclanthology.org/2025.coling-main.693/
DOI:
Bibkey:
Cite (ACL):: Wentong Chen, Yankai Lin, ZhenHao Zhou, HongYun Huang, YanTao Jia, Zhao Cao, and Ji-Rong Wen. 2025. ICLEval: Evaluating In-Context Learning Ability of Large Language Models. In Proceedings of the 31st International Conference on Computational Linguistics, pages 10398–10422, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):: ICLEval: Evaluating In-Context Learning Ability of Large Language Models (Chen et al., COLING 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.coling-main.693.pdf

PDF Cite Search Fix data