ICLAD: In-Context Learning with Comparison-Guidance for Audio Deepfake Detection

Benjamin Shiue-Hal Chou; Yi Zhu; Surya Koppisetti

ICLAD: In-Context Learning with Comparison-Guidance for Audio Deepfake Detection

Benjamin Shiue-Hal Chou, Yi Zhu, Surya Koppisetti

Abstract

Audio deepfakes pose a significant security threat, yet current state-of-the-art (SOTA) detection systems do not generalize well to realistic in-the-wild deepfakes. We introduce a novel In-Context Learning paradigm with comparison-guidance for Audio Deepfake detection (ICLAD). The framework enables the use of audio language models (ALMs) for training-free generalization to unseen deepfakes and provides rich textual explanations on the detection outcome. At the core of ICLAD is a pairwise comparative reasoning strategy that guides the ALM to discover and filter hallucinations and deepfake-irrelevant acoustic attributes. The ALM works alongside a specialized deepfake detector, whereby a routing mechanism feeds out-of-distribution samples to the ALM. On in-the-wild datasets, ICLAD improves macro F1 over the specialized detector, with up to 2× relative improvement. Further analysis demonstrates the flexibility of ICLAD and its potential for deployment on recent open-source ALMs.

Anthology ID:: 2026.findings-acl.450
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9242–9256
Language:
URL:: https://aclanthology.org/2026.findings-acl.450/
DOI:
Bibkey:
Cite (ACL):: Benjamin Shiue-Hal Chou, Yi Zhu, and Surya Koppisetti. 2026. ICLAD: In-Context Learning with Comparison-Guidance for Audio Deepfake Detection. In Findings of the Association for Computational Linguistics: ACL 2026, pages 9242–9256, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ICLAD: In-Context Learning with Comparison-Guidance for Audio Deepfake Detection (Chou et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.450.pdf
Checklist:: 2026.findings-acl.450.checklist.pdf

PDF Cite Search Checklist Fix data