The Limits of Post-hoc Preference Adaptation: A Case Study on DSTC12 Clustering

Jihyun Lee; Gary Lee

The Limits of Post-hoc Preference Adaptation: A Case Study on DSTC12 Clustering

Abstract

Understanding user intent in dialogue is essential for controllable and coherent conversational AI. In this work, we present a case study on controllable theme induction in dialogue systems using the DSTC12 Track 2 dataset. Our pipeline integrates LLM-based summarization, utterance clustering, and synthetic preference modeling based on should-link and cannot-link predictions. While preference signals offer moderate improvements in cluster refinement, we observe that their effectiveness is significantly constrained by coarse initial clustering. Experiments on the Finance and Insurance domains show that even authentic human labeled preference struggle when initial clusters do not align with human intent. These findings highlight the need to incorporate preference supervision earlier in the pipeline to ensure semantically coherent clustering.

Anthology ID:: 2025.dstc-1.4
Volume:: Proceedings of the Twelfth Dialog System Technology Challenge
Month:: August
Year:: 2025
Address:: Avignon, France
Editors:: Behnam Hedayatnia, Vivian Chen, Zhang Chen, Raghav Gupta, Michel Galley
Venues:: DSTC | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 36–43
Language:
URL:: https://aclanthology.org/2025.dstc-1.4/
DOI:
Bibkey:
Cite (ACL):: Jihyun Lee and Gary Lee. 2025. The Limits of Post-hoc Preference Adaptation: A Case Study on DSTC12 Clustering. In Proceedings of the Twelfth Dialog System Technology Challenge, pages 36–43, Avignon, France. Association for Computational Linguistics.
Cite (Informal):: The Limits of Post-hoc Preference Adaptation: A Case Study on DSTC12 Clustering (Lee & Lee, DSTC 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.dstc-1.4.pdf

PDF Cite Search Fix data