Beyond Query Bias: Candidate-Aware Iterative Refinement for Zero-Shot Composed Image Retrieval

Nan Sun; Jing Tang; Lei Sun; Rui Chen (陈蕊); Yuxing Lu; Xiangxiang Chu; Hefei Ling; Yujun Cai

Beyond Query Bias: Candidate-Aware Iterative Refinement for Zero-Shot Composed Image Retrieval

Nan Sun, Jing Tang, Lei Sun, Rui Chen, Yuxing Lu, Xiangxiang Chu, Hefei Ling, Yujun Cai

Abstract

Zero-Shot Composed Image Retrieval (ZS-CIR) retrieves target images using a reference image and modification text without task-specific training. Existing methods typically rely on MLLMs to generate query vectors with pre-trained models like CLIP. However, those constructed queries suffer from inherent cognitive bias due to unknown candidate distribution. We propose CoRR, a training-free framework that reframes ZS-CIR as a self-correcting process through bias-aware query refinement. CoRR uses retrieved results as feedback to perceive the candidate distribution. With carefully designed CoT prompting, the MLLM inspects the retrieved candidates to identify intent misalignments in the query and then corrects them via Historical Query Fusion. We also introduce Retrieval-Driven Caption Optimization to provide context-aligned examples, reducing phrasing and style mismatches. Experiments on public benchmarks show that CoRR significantly outperforms other SOTA methods.

Anthology ID:: 2026.findings-acl.1120
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22318–22329
Language:
URL:: https://aclanthology.org/2026.findings-acl.1120/
DOI:
Bibkey:
Cite (ACL):: Nan Sun, Jing Tang, Lei Sun, Rui Chen, Yuxing Lu, Xiangxiang Chu, Hefei Ling, and Yujun Cai. 2026. Beyond Query Bias: Candidate-Aware Iterative Refinement for Zero-Shot Composed Image Retrieval. In Findings of the Association for Computational Linguistics: ACL 2026, pages 22318–22329, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Beyond Query Bias: Candidate-Aware Iterative Refinement for Zero-Shot Composed Image Retrieval (Sun et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1120.pdf
Checklist:: 2026.findings-acl.1120.checklist.pdf

PDF Cite Search Checklist Fix data