Scaling Performance and Low-Resource Annotation with Many-Shot In-Context Learning for Named Entity Recognition

Qi Zhang; Fangping Lan; Cornelia Caragea; Longin Jan Latecki; Eduard Dragut

Scaling Performance and Low-Resource Annotation with Many-Shot In-Context Learning for Named Entity Recognition

Qi Zhang, Fangping Lan, Cornelia Caragea, Longin Jan Latecki, Eduard Dragut

Abstract

In-context learning (ICL) with large language models (LLMs) has emerged as a powerful alternative to fine-tuning for Named Entity Recognition (NER), achieving strong performance with minimal annotation and no additional training. However, prior work has shown that despite their adaptability, LLMs still lag behind fully supervised models such as fine-tuned BERT in structured tasks like NER. While existing studies on ICL for NER have mainly explored few-shot settings, the potential of scaling to hundreds of demonstrations has not been thoroughly investigated. To address this gap, we conduct a comprehensive investigation of many-shot ICL for NER and further explore its effectiveness in annotating and refining data for low-resource NER tasks. Specifically, we evaluate various LLMs across multiple domains using hundreds of ICL examples and then assess the feasibility of using many-shot ICL as a data annotation framework. Our experiments demonstrate that: (1) scaling to hundreds of in-context examples enables LLMs to match or even surpass the performance of fully supervised BERT models; and (2) using about one hundred human-labeled examples as demonstrations, many-shot in-context annotation can generate high-quality labeled data, leading to approximately 10% absolute F1 improvement over existing state-of-the-art approaches when used to fine-tune BERT on low-resource NER.

Anthology ID:: 2026.findings-acl.1431
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 28653–28673
Language:
URL:: https://aclanthology.org/2026.findings-acl.1431/
DOI:
Bibkey:
Cite (ACL):: Qi Zhang, Fangping Lan, Cornelia Caragea, Longin Jan Latecki, and Eduard Dragut. 2026. Scaling Performance and Low-Resource Annotation with Many-Shot In-Context Learning for Named Entity Recognition. In Findings of the Association for Computational Linguistics: ACL 2026, pages 28653–28673, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Scaling Performance and Low-Resource Annotation with Many-Shot In-Context Learning for Named Entity Recognition (Zhang et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1431.pdf
Checklist:: 2026.findings-acl.1431.checklist.pdf

PDF Cite Search Checklist Fix data