AliEdalat at SemEval-2022 Task 4: Patronizing and Condescending Language Detection using Fine-tuned Language Models, BERT+BiGRU, and Ensemble Models

Ali Edalat, Yadollah Yaghoobzadeh, Behnam Bahrak


Abstract
This paper presents the AliEdalat team’s methodology and results in SemEval-2022 Task 4: Patronizing and Condescending Language (PCL) Detection. This task aims to detect the presence of PCL and PCL categories in text in order to prevent further discrimination against vulnerable communities. We use an ensemble of three basic models to detect the presence of PCL: fine-tuned bigbird, fine-tuned mpnet, and BERT+BiGRU. The ensemble model performs worse than the baseline due to overfitting and achieves an F1-score of 0.3031. We offer another solution to resolve the submitted model’s problem. We consider the different categories of PCL separately. To detect each category of PCL, we act like a PCL detector. Instead of BERT+BiGRU, we use fine-tuned roberta in the models. In PCL category detection, our model outperforms the baseline model and achieves an F1-score of 0.2531. We also present new models for detecting two categories of PCL that outperform the submitted models.
Anthology ID:
2022.semeval-1.51
Volume:
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Guy Emerson, Natalie Schluter, Gabriel Stanovsky, Ritesh Kumar, Alexis Palmer, Nathan Schneider, Siddharth Singh, Shyam Ratan
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
387–393
Language:
URL:
https://aclanthology.org/2022.semeval-1.51
DOI:
10.18653/v1/2022.semeval-1.51
Bibkey:
Cite (ACL):
Ali Edalat, Yadollah Yaghoobzadeh, and Behnam Bahrak. 2022. AliEdalat at SemEval-2022 Task 4: Patronizing and Condescending Language Detection using Fine-tuned Language Models, BERT+BiGRU, and Ensemble Models. In Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), pages 387–393, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
AliEdalat at SemEval-2022 Task 4: Patronizing and Condescending Language Detection using Fine-tuned Language Models, BERT+BiGRU, and Ensemble Models (Edalat et al., SemEval 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.semeval-1.51.pdf
Code
 aliedalat/semeval-2022-task-4-pcl-detection
Data
DPMTalkDown