Towards Zero-Shot SLU: An Empirical Study of Competing Architectural Paradigms

Beomseok Lee; Marco Gaido; Ioan Calapodescu; Laurent Besacier; Matteo Negri

Towards Zero-Shot SLU: An Empirical Study of Competing Architectural Paradigms

Beomseok Lee, Marco Gaido, Ioan Calapodescu, Laurent Besacier, Matteo Negri

Abstract

Spoken Language Understanding (SLU) is crucial for enabling natural voice interactions with modern devices. However, traditional supervised models fail to generalize to new domains due to two key challenges: the prohibitive cost of data annotation and the inherent difficulty of transferring domain-specific intents. While the rise of Large Language Models (LLMs) offers a promising solution through zero-shot inference, the zero-shot SLU capabilities of emerging speech-enabled LLMs have remained largely unexplored. To address this gap, this paper provides the first comprehensive assessment, focusing on intent classification (IC), the first key sub-task of SLU, across 13 languages. We systematically evaluate a range of architectures, including cascaded, end-to-end, and hybrid systems for zero-shot SLU. Our analysis identifies the hybrid approach as the most effective architectural design for end-to-end SLU, and assesses multilingual transfer capabilities. The findings offer a detailed map of the challenges and opportunities, highlighting which models and settings are most promising for zero-shot SLU.

Anthology ID:: 2026.iwslt-1.1
Volume:: Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
Month:: July
Year:: 2026
Address:: San Diego, USA (in-person and online)
Editors:: Elizabeth Salesky, Antonios Anastasopoulos, Matteo Negri, Marcello Federico
Venues:: IWSLT | WS
SIG:: SIGSLT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1–7
Language:
URL:: https://aclanthology.org/2026.iwslt-1.1/
DOI:
Bibkey:
Cite (ACL):: Beomseok Lee, Marco Gaido, Ioan Calapodescu, Laurent Besacier, and Matteo Negri. 2026. Towards Zero-Shot SLU: An Empirical Study of Competing Architectural Paradigms. In Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026), pages 1–7, San Diego, USA (in-person and online). Association for Computational Linguistics.
Cite (Informal):: Towards Zero-Shot SLU: An Empirical Study of Competing Architectural Paradigms (Lee et al., IWSLT 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.iwslt-1.1.pdf

PDF Cite Search Fix data