SharedCon: Implicit Hate Speech Detection using Shared Semantics

Hyeseon Ahn, Youngwook Kim, Jungin Kim, Yo-Sub Han


Abstract
The ever-growing presence of hate speech on social network services and other online platforms not only fuels online harassment but also presents a growing challenge for hate speech detection. As this task is akin to binary classification, one of the promising approaches for hate speech detection is the utilization of contrastive learning. Recent studies suggest that classifying hateful posts in just a binary manner may not adequately address the nuanced task of detecting implicit hate speech. This challenge is largely due to the subtle nature and context dependency of such pejorative remarks. Previous studies proposed a modified contrastive learning approach equipped with additional aids such as human-written implications or machine-generated augmented data for better implicit hate speech detection. While this approach can potentially enhance the overall performance by its additional data in general, it runs the risk of overfitting as well as heightened cost and time to obtain. These drawbacks serve as motivation for us to design a methodology that is not dependent on human-written or machine-generated augmented data for training. We propose a straightforward, yet effective, clustering-based contrastive learning approach that leverages the shared semantics among the data.
Anthology ID:
2024.findings-acl.622
Volume:
Findings of the Association for Computational Linguistics ACL 2024
Month:
August
Year:
2024
Address:
Bangkok, Thailand and virtual meeting
Editors:
Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10444–10455
Language:
URL:
https://aclanthology.org/2024.findings-acl.622
DOI:
Bibkey:
Cite (ACL):
Hyeseon Ahn, Youngwook Kim, Jungin Kim, and Yo-Sub Han. 2024. SharedCon: Implicit Hate Speech Detection using Shared Semantics. In Findings of the Association for Computational Linguistics ACL 2024, pages 10444–10455, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.
Cite (Informal):
SharedCon: Implicit Hate Speech Detection using Shared Semantics (Ahn et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-acl.622.pdf