A Semantics-based Approach to Disclosure Classification in User-Generated Online Content

Chandan Akiti, Anna Squicciarini, Sarah Rajtmajer


Abstract
As users engage in public discourse, the rate of voluntarily disclosed personal information has seen a steep increase. So-called self-disclosure can result in a number of privacy concerns. Users are often unaware of the sheer amount of personal information they share across online forums, commentaries, and social networks, as well as the power of modern AI to synthesize and gain insights from this data. This paper presents an approach to detect emotional and informational self-disclosure in natural language. We hypothesize that identifying frame semantics can meaningfully support this task. Specifically, we use Semantic Role Labeling to identify the lexical units and their semantic roles that signal self-disclosure. Experimental results on Reddit data show the performance gain of our method when compared to standard text classification methods based on BiLSTM, and BERT. In addition to improved performance, our approach provides insights into the drivers of disclosure behaviors.
Anthology ID:
2020.findings-emnlp.312
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2020
Month:
November
Year:
2020
Address:
Online
Editors:
Trevor Cohn, Yulan He, Yang Liu
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3490–3499
Language:
URL:
https://aclanthology.org/2020.findings-emnlp.312
DOI:
10.18653/v1/2020.findings-emnlp.312
Bibkey:
Cite (ACL):
Chandan Akiti, Anna Squicciarini, and Sarah Rajtmajer. 2020. A Semantics-based Approach to Disclosure Classification in User-Generated Online Content. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 3490–3499, Online. Association for Computational Linguistics.
Cite (Informal):
A Semantics-based Approach to Disclosure Classification in User-Generated Online Content (Akiti et al., Findings 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.findings-emnlp.312.pdf
Optional supplementary material:
 2020.findings-emnlp.312.OptionalSupplementaryMaterial.zip