UCS: Estimating Unseen Coverage for Improved In-Context Learning

Jiayi Xin; Xiang Li; Evan Qiang; Weiqing He; Tianqi Shang; Weijie J Su; Qi Long

UCS: Estimating Unseen Coverage for Improved In-Context Learning

Jiayi Xin, Xiang Li, Evan Qiang, Weiqing He, Tianqi Shang, Weijie J Su, Qi Long

Abstract

In-context learning (ICL) performance depends critically on which demonstrations are placed in the prompt, yet most existing selectors prioritize heuristic notions of relevance or diversity and provide limited insight into the coverage of a demonstration set. We propose Unseen Coverage Selection (UCS), a training-free, subset-level coverage prior motivated by the principle that a good demonstration set should expose the model to latent cluster unrevealed by the currently selected subset. UCS operationalizes this idea by (1) inducing discrete latent clusters from model-consistent embeddings and (2) estimating the number of unrevealed clusters within a candidate subset via a Smoothed Good-Turing estimator from its empirical frequency spectrum. Unlike previous selection methods, UCS is coverage-based and training-free, and can be seamlessly combined with both query-dependent and query-independent selection baselines via a simple regularized objective. Experiments on multiple intent-classification and reasoning benchmarks with frontier Large Language Models show that augmenting strong baselines with UCS consistently improves ICL accuracy by up to 2-6% under the same selection budget, while also yielding insights into task- and model-level latent cluster distributions. Code is available at https://github.com/Raina-Xin/UCS.

Anthology ID:: 2026.findings-acl.533
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 10965–10981
Language:
URL:: https://aclanthology.org/2026.findings-acl.533/
DOI:
Bibkey:
Cite (ACL):: Jiayi Xin, Xiang Li, Evan Qiang, Weiqing He, Tianqi Shang, Weijie J Su, and Qi Long. 2026. UCS: Estimating Unseen Coverage for Improved In-Context Learning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 10965–10981, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: UCS: Estimating Unseen Coverage for Improved In-Context Learning (Xin et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.533.pdf
Checklist:: 2026.findings-acl.533.checklist.pdf

PDF Cite Search Checklist Fix data