KMI: A Dataset of Korean Motivational Interviewing Dialogues for Psychotherapy

Hyunjong Kim; Suyeon Lee; Yeongjae Cho; Eunseo Ryu; Yohan Jo; Suran Seong; Sungzoon Cho

doi:10.18653/v1/2025.naacl-long.541

KMI: A Dataset of Korean Motivational Interviewing Dialogues for Psychotherapy

Hyunjong Kim, Suyeon Lee, Yeongjae Cho, Eunseo Ryu, Yohan Jo, Suran Seong, Sungzoon Cho

Abstract

The increasing demand for mental health services has led to the rise of AI-driven mental health chatbots, though challenges related to privacy, data collection, and expertise persist. Motivational Interviewing (MI) is gaining attention as a theoretical basis for boosting expertise in the development of these chatbots. However, existing datasets are showing limitations for training chatbots, leading to a substantial demand for publicly available resources in the field of MI and psychotherapy. These challenges are even more pronounced in non-English languages, where they receive less attention. In this paper, we propose a novel framework that simulates MI sessions enriched with the expertise of professional therapists. We train an MI forecaster model that mimics the behavioral choices of professional therapists and employ Large Language Models (LLMs) to generate utterances through prompt engineering. Then, we present KMI, the first synthetic dataset theoretically grounded in MI, containing 1,000 high-quality Korean Motivational Interviewing dialogues. Through an extensive expert evaluation of the generated dataset and the dialogue model trained on it, we demonstrate the quality, expertise, and practicality of KMI. We also introduce novel metrics derived from MI theory in order to evaluate dialogues from the perspective of MI.

Anthology ID:: 2025.naacl-long.541
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 10803–10828
Language:
URL:: https://aclanthology.org/2025.naacl-long.541/
DOI:: 10.18653/v1/2025.naacl-long.541
Bibkey:
Cite (ACL):: Hyunjong Kim, Suyeon Lee, Yeongjae Cho, Eunseo Ryu, Yohan Jo, Suran Seong, and Sungzoon Cho. 2025. KMI: A Dataset of Korean Motivational Interviewing Dialogues for Psychotherapy. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 10803–10828, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: KMI: A Dataset of Korean Motivational Interviewing Dialogues for Psychotherapy (Kim et al., NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-long.541.pdf

PDF Cite Search Fix data