Depression and Self-Harm Risk Assessment in Online Forums

Andrew Yates, Arman Cohan, Nazli Goharian


Abstract
Users suffering from mental health conditions often turn to online resources for support, including specialized online support communities or general communities such as Twitter and Reddit. In this work, we present a framework for supporting and studying users in both types of communities. We propose methods for identifying posts in support communities that may indicate a risk of self-harm, and demonstrate that our approach outperforms strong previously proposed methods for identifying such posts. Self-harm is closely related to depression, which makes identifying depressed users on general forums a crucial related task. We introduce a large-scale general forum dataset consisting of users with self-reported depression diagnoses matched with control users. We show how our method can be applied to effectively identify depressed users from their use of language alone. We demonstrate that our method outperforms strong baselines on this general forum dataset.
Anthology ID:
D17-1322
Volume:
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
Month:
September
Year:
2017
Address:
Copenhagen, Denmark
Editors:
Martha Palmer, Rebecca Hwa, Sebastian Riedel
Venue:
EMNLP
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
2968–2978
Language:
URL:
https://aclanthology.org/D17-1322/
DOI:
10.18653/v1/D17-1322
Bibkey:
Cite (ACL):
Andrew Yates, Arman Cohan, and Nazli Goharian. 2017. Depression and Self-Harm Risk Assessment in Online Forums. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2968–2978, Copenhagen, Denmark. Association for Computational Linguistics.
Cite (Informal):
Depression and Self-Harm Risk Assessment in Online Forums (Yates et al., EMNLP 2017)
Copy Citation:
PDF:
https://aclanthology.org/D17-1322.pdf
Data
SDCNL (Suicide vs Depression Classification)