Chamain: Harmonizing Character Persona Integrity with Domain-Adaptive Knowledge in Dialogue Generation

Seung-Moo Yang, Jeehyun Lee, Won Ik Cho


Abstract
Recent advances in large language models (LLMs) have shown their capacity for generating natural dialogues, leveraging extensive pre-trained knowledge. However, the seamless integration of domain-specific knowledge into dialogue agents, without undermining their personas or unique textual style, remains a challenging task. Traditional approaches, such as constructing knowledge-aware character dialogue datasets or training LLMs from the ground up, require considerable resources. Sequentially fine-tuning character chatbots across multiple datasets or applying existing merging techniques often leads to catastrophic forgetting, resulting in the loss of both knowledge and the character’s distinct persona. This compromises the model’s ability to consistently generate character-driven dialogues within a user-centric framework. In this context, we introduce a novel model merging method, Chamain, which effortlessly enhances the performance of character models, much like finding a “free lunch”. Chamain merges domain-specific knowledge into a character model by parameter-wise weight combination of instruction-tuned models and learns to reflect persona’s unique characteristics and style through Layer-wise merging. Our experiments demonstrate that Chamain effectively maintains style while also solving domain-specific problems to a certain extent compared to the baselines, even showing a higher style probability compared to the character model in legal QA.
Anthology ID:
2024.nlp4convai-1.7
Volume:
Proceedings of the 6th Workshop on NLP for Conversational AI (NLP4ConvAI 2024)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Elnaz Nouri, Abhinav Rastogi, Georgios Spithourakis, Bing Liu, Yun-Nung Chen, Yu Li, Alon Albalak, Hiromi Wakaki, Alexandros Papangelis
Venues:
NLP4ConvAI | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
101–113
Language:
URL:
https://aclanthology.org/2024.nlp4convai-1.7
DOI:
Bibkey:
Cite (ACL):
Seung-Moo Yang, Jeehyun Lee, and Won Ik Cho. 2024. Chamain: Harmonizing Character Persona Integrity with Domain-Adaptive Knowledge in Dialogue Generation. In Proceedings of the 6th Workshop on NLP for Conversational AI (NLP4ConvAI 2024), pages 101–113, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Chamain: Harmonizing Character Persona Integrity with Domain-Adaptive Knowledge in Dialogue Generation (Yang et al., NLP4ConvAI-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.nlp4convai-1.7.pdf