Detection of Mental Health from Reddit via Deep Contextualized Representations

Zheng Ping Jiang; Sarah Ita Levitan; Jonathan Zomick; Julia Hirschberg

doi:10.18653/v1/2020.louhi-1.16

Detection of Mental Health from Reddit via Deep Contextualized Representations

Zhengping Jiang, Sarah Ita Levitan, Jonathan Zomick, Julia Hirschberg

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

We address the problem of automatic detection of psychiatric disorders from the linguistic content of social media posts. We build a large scale dataset of Reddit posts from users with eight disorders and a control user group. We extract and analyze linguistic characteristics of posts and identify differences between diagnostic groups. We build strong classification models based on deep contextualized word representations and show that they outperform previously applied statistical models with simple linguistic features by large margins. We compare user-level and post-level classification performance, as well as an ensembled multiclass model.

Anthology ID:: 2020.louhi-1.16
Volume:: Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis
Month:: November
Year:: 2020
Address:: Online
Editors:: Eben Holderness, Antonio Jimeno Yepes, Alberto Lavelli, Anne-Lyse Minard, James Pustejovsky, Fabio Rinaldi
Venue:: Louhi
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 147–156
Language:
URL:: https://aclanthology.org/2020.louhi-1.16/
DOI:: 10.18653/v1/2020.louhi-1.16
Bibkey:
Cite (ACL):: Zhengping Jiang, Sarah Ita Levitan, Jonathan Zomick, and Julia Hirschberg. 2020. Detection of Mental Health from Reddit via Deep Contextualized Representations. In Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis, pages 147–156, Online. Association for Computational Linguistics.
Cite (Informal):: Detection of Mental Health from Reddit via Deep Contextualized Representations (Jiang et al., Louhi 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.louhi-1.16.pdf
Video:: https://slideslive.com/38940049

PDF Cite Search Video Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{jiang-etal-2020-detection,
    title = "Detection of Mental Health from {R}eddit via Deep Contextualized Representations",
    author = "Jiang, Zhengping  and
      Levitan, Sarah Ita  and
      Zomick, Jonathan  and
      Hirschberg, Julia",
    editor = "Holderness, Eben  and
      Jimeno Yepes, Antonio  and
      Lavelli, Alberto  and
      Minard, Anne-Lyse  and
      Pustejovsky, James  and
      Rinaldi, Fabio",
    booktitle = "Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis",
    month = nov,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2020.louhi-1.16/",
    doi = "10.18653/v1/2020.louhi-1.16",
    pages = "147--156",
    abstract = "We address the problem of automatic detection of psychiatric disorders from the linguistic content of social media posts. We build a large scale dataset of Reddit posts from users with eight disorders and a control user group. We extract and analyze linguistic characteristics of posts and identify differences between diagnostic groups. We build strong classification models based on deep contextualized word representations and show that they outperform previously applied statistical models with simple linguistic features by large margins. We compare user-level and post-level classification performance, as well as an ensembled multiclass model."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="jiang-etal-2020-detection">
    <titleInfo>
        <title>Detection of Mental Health from Reddit via Deep Contextualized Representations</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Zhengping</namePart>
        <namePart type="family">Jiang</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Sarah</namePart>
        <namePart type="given">Ita</namePart>
        <namePart type="family">Levitan</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Jonathan</namePart>
        <namePart type="family">Zomick</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Julia</namePart>
        <namePart type="family">Hirschberg</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2020-11</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Eben</namePart>
            <namePart type="family">Holderness</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Antonio</namePart>
            <namePart type="family">Jimeno Yepes</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Alberto</namePart>
            <namePart type="family">Lavelli</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Anne-Lyse</namePart>
            <namePart type="family">Minard</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">James</namePart>
            <namePart type="family">Pustejovsky</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Fabio</namePart>
            <namePart type="family">Rinaldi</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Online</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>We address the problem of automatic detection of psychiatric disorders from the linguistic content of social media posts. We build a large scale dataset of Reddit posts from users with eight disorders and a control user group. We extract and analyze linguistic characteristics of posts and identify differences between diagnostic groups. We build strong classification models based on deep contextualized word representations and show that they outperform previously applied statistical models with simple linguistic features by large margins. We compare user-level and post-level classification performance, as well as an ensembled multiclass model.</abstract>
    <identifier type="citekey">jiang-etal-2020-detection</identifier>
    <identifier type="doi">10.18653/v1/2020.louhi-1.16</identifier>
    <location>
        <url>https://aclanthology.org/2020.louhi-1.16/</url>
    </location>
    <part>
        <date>2020-11</date>
        <extent unit="page">
            <start>147</start>
            <end>156</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Detection of Mental Health from Reddit via Deep Contextualized Representations
%A Jiang, Zhengping
%A Levitan, Sarah Ita
%A Zomick, Jonathan
%A Hirschberg, Julia
%Y Holderness, Eben
%Y Jimeno Yepes, Antonio
%Y Lavelli, Alberto
%Y Minard, Anne-Lyse
%Y Pustejovsky, James
%Y Rinaldi, Fabio
%S Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis
%D 2020
%8 November
%I Association for Computational Linguistics
%C Online
%F jiang-etal-2020-detection
%X We address the problem of automatic detection of psychiatric disorders from the linguistic content of social media posts. We build a large scale dataset of Reddit posts from users with eight disorders and a control user group. We extract and analyze linguistic characteristics of posts and identify differences between diagnostic groups. We build strong classification models based on deep contextualized word representations and show that they outperform previously applied statistical models with simple linguistic features by large margins. We compare user-level and post-level classification performance, as well as an ensembled multiclass model.
%R 10.18653/v1/2020.louhi-1.16
%U https://aclanthology.org/2020.louhi-1.16/
%U https://doi.org/10.18653/v1/2020.louhi-1.16
%P 147-156

Download as File

Markdown (Informal)

[Detection of Mental Health from Reddit via Deep Contextualized Representations](https://aclanthology.org/2020.louhi-1.16/) (Jiang et al., Louhi 2020)

Detection of Mental Health from Reddit via Deep Contextualized Representations (Jiang et al., Louhi 2020)

ACL

Zhengping Jiang, Sarah Ita Levitan, Jonathan Zomick, and Julia Hirschberg. 2020. Detection of Mental Health from Reddit via Deep Contextualized Representations. In Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis, pages 147–156, Online. Association for Computational Linguistics.