DEPAC: a Corpus for Depression and Anxiety Detection from Speech

Mashrura Tasnim, Malikeh Ehghaghi, Brian Diep, Jekaterina Novikova


Abstract
Mental distress like depression and anxiety contribute to the largest proportion of the global burden of diseases. Automated diagnosis system of such disorders, empowered by recent innovations in Artificial Intelligence, can pave the way to reduce the sufferings of the affected individuals. Development of such systems requires information-rich and balanced corpora. In this work, we introduce a novel mental distress analysis audio dataset DEPAC, labelled based on established thresholds on depression and anxiety standard screening tools. This large dataset comprises multiple speech tasks per individual, as well as relevant demographic information. Alongside, we present a feature set consisting of hand-curated acoustic and linguistic features, which were found effective in identifying signs of mental illnesses in human speech. Finally, we justify the quality and effectiveness of our proposed audio corpus and feature set in predicting depression severity by comparing the performance of baseline machine learning models built on this dataset with baseline models trained on other well-known depression corpora.
Anthology ID:
2022.clpsych-1.1
Volume:
Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology
Month:
July
Year:
2022
Address:
Seattle, USA
Editors:
Ayah Zirikly, Dana Atzil-Slonim, Maria Liakata, Steven Bedrick, Bart Desmet, Molly Ireland, Andrew Lee, Sean MacAvaney, Matthew Purver, Rebecca Resnik, Andrew Yates
Venue:
CLPsych
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–16
Language:
URL:
https://aclanthology.org/2022.clpsych-1.1
DOI:
10.18653/v1/2022.clpsych-1.1
Bibkey:
Cite (ACL):
Mashrura Tasnim, Malikeh Ehghaghi, Brian Diep, and Jekaterina Novikova. 2022. DEPAC: a Corpus for Depression and Anxiety Detection from Speech. In Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pages 1–16, Seattle, USA. Association for Computational Linguistics.
Cite (Informal):
DEPAC: a Corpus for Depression and Anxiety Detection from Speech (Tasnim et al., CLPsych 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.clpsych-1.1.pdf
Appendix:
 2022.clpsych-1.1.appendix.pdf
Video:
 https://aclanthology.org/2022.clpsych-1.1.mp4