IDRE: AI Generated Dataset for Enhancing Empathetic Chatbot Interactions in Italian Language.

Simone Manai, Laura Gemme, Roberto Zanoli, Alberto Lavelli


Abstract
This paper introduces IDRE (Italian Dataset for Rephrasing with Empathy), a novel automatically generated Italian linguistic dataset. IDRE comprises typical chatbot user utterances in the healthcare domain, corresponding chatbot responses, and empathetically enhanced chatbot responses. The dataset was generated using the Llama2 language model and evaluated by human raters based on predefined metrics. The IDRE dataset offers a comprehensive and realistic collection of Italian chatbot-user interactions suitable for training and refining chatbot models in the healthcare domain. This facilitates the development of chatbots capable of natural and productive conversations with healthcare users. Notably, the dataset incorporates empathetically enhanced chatbot responses, enabling researchers to investigate the effects of empathetic language on fostering more positive and engaging human-machine interactions within healthcare settings. The methodology employed for the construction of the IDRE dataset can be extended to generate phrases in additional languages and domains, thereby expanding its applicability and utility. The IDRE dataset is publicly available for research purposes.
Anthology ID:
2024.clicit-1.113
Volume:
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
Month:
December
Year:
2024
Address:
Pisa, Italy
Editors:
Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Rachele Sprugnoli
Venue:
CLiC-it
SIG:
Publisher:
CEUR Workshop Proceedings
Note:
Pages:
1036–1042
Language:
URL:
https://aclanthology.org/2024.clicit-1.113/
DOI:
Bibkey:
Cite (ACL):
Simone Manai, Laura Gemme, Roberto Zanoli, and Alberto Lavelli. 2024. IDRE: AI Generated Dataset for Enhancing Empathetic Chatbot Interactions in Italian Language.. In Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024), pages 1036–1042, Pisa, Italy. CEUR Workshop Proceedings.
Cite (Informal):
IDRE: AI Generated Dataset for Enhancing Empathetic Chatbot Interactions in Italian Language. (Manai et al., CLiC-it 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.clicit-1.113.pdf