Evaluating Spoken Language Features in Conversational Models: The Case of Disfluencies and Feedbacks

Oussama Silem; Maïwenn Fleig; Philippe Blache; Houda Oufaida; Leonor Becerra-Bonache

Evaluating Spoken Language Features in Conversational Models: The Case of Disfluencies and Feedbacks

Oussama Silem, Maïwenn Fleig, Philippe Blache, Houda Oufaida, Leonor Becerra-Bonache

Abstract

Understanding how language is processed and represented cognitively increasingly involves the use of specialized language models. Yet, because most models are predominantly trained on written text, they struggle to reflect the characteristics of language as it naturally unfolds in spoken interaction. This gap limits their capabilities in capturing features typical of spontaneous speech, such as repetitions, feedback cues, and hesitations. In this work, we introduce linguistically motivated evaluation metrics designed to target these specific spoken-language phenomena. We apply them to analyse outputs from language models fine-tuned on spoken English and French, comparing their behaviour statistically with human dialogue corpora. Our findings highlight the value of these metrics for assessing the degree to which model-generated utterances resemble authentic human conversation.

Anthology ID:: 2025.sigdial-1.23
Volume:: Proceedings of the 26th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Month:: August
Year:: 2025
Address:: Avignon, France
Editors:: Frédéric Béchet, Fabrice Lefèvre, Nicholas Asher, Seokhwan Kim, Teva Merlin
Venue:: SIGDIAL
SIG:: SIGDIAL
Publisher:: Association for Computational Linguistics
Note:
Pages:: 285–293
Language:
URL:: https://aclanthology.org/2025.sigdial-1.23/
DOI:
Bibkey:
Cite (ACL):: Oussama Silem, Maïwenn Fleig, Philippe Blache, Houda Oufaida, and Leonor Becerra-Bonache. 2025. Evaluating Spoken Language Features in Conversational Models: The Case of Disfluencies and Feedbacks. In Proceedings of the 26th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 285–293, Avignon, France. Association for Computational Linguistics.
Cite (Informal):: Evaluating Spoken Language Features in Conversational Models: The Case of Disfluencies and Feedbacks (Silem et al., SIGDIAL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.sigdial-1.23.pdf

PDF Cite Search Fix data