Mental Health Surveillance over Social Media with Digital Cohorts

Silvio Amir, Mark Dredze, John W. Ayers


Abstract
The ability to track mental health conditions via social media opened the doors for large-scale, automated, mental health surveillance. However, inferring accurate population-level trends requires representative samples of the underlying population, which can be challenging given the biases inherent in social media data. While previous work has adjusted samples based on demographic estimates, the populations were selected based on specific outcomes, e.g. specific mental health conditions. We depart from these methods, by conducting analyses over demographically representative digital cohorts of social media users. To validated this approach, we constructed a cohort of US based Twitter users to measure the prevalence of depression and PTSD, and investigate how these illnesses manifest across demographic subpopulations. The analysis demonstrates that cohort-based studies can help control for sampling biases, contextualize outcomes, and provide deeper insights into the data.
Anthology ID:
W19-3013
Volume:
Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology
Month:
June
Year:
2019
Address:
Minneapolis, Minnesota
Editors:
Kate Niederhoffer, Kristy Hollingshead, Philip Resnik, Rebecca Resnik, Kate Loveys
Venue:
CLPsych
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
114–120
Language:
URL:
https://aclanthology.org/W19-3013
DOI:
10.18653/v1/W19-3013
Bibkey:
Cite (ACL):
Silvio Amir, Mark Dredze, and John W. Ayers. 2019. Mental Health Surveillance over Social Media with Digital Cohorts. In Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, pages 114–120, Minneapolis, Minnesota. Association for Computational Linguistics.
Cite (Informal):
Mental Health Surveillance over Social Media with Digital Cohorts (Amir et al., CLPsych 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-3013.pdf