HHS at SemEval-2023 Task 10: A Comparative Analysis of Sexism Detection Based on the RoBERTa Model

Yao Zhang; Liqing Wang

doi:10.18653/v1/2023.semeval-1.133

HHS at SemEval-2023 Task 10: A Comparative Analysis of Sexism Detection Based on the RoBERTa Model

Abstract

This paper describes the methods and models applied by our team HHS in SubTask-A of SemEval-2023 Task 10 about sexism detection. In this task, we trained with the officially released data and analyzed the performance of five models, TextCNN, BERT, RoBERTa, XLNet, and Sup-SimCSE-RoBERTa. The experiments show that most of the models can achieve good results. Then, we tried data augmentation, model ensemble, dropout, and other operations on several of these models, and compared the results for analysis. In the end, the most effective approach that yielded the best results on the test set involved the following steps: enhancing the sexist data using dropout, feeding it as input to the Sup-SimCSE-RoBERTa model, and providing the raw data as input to the XLNet model. Then, combining the outputs of the two methods led to even better results. This method yielded a Macro-F1 score of 0.823 in the final evaluation phase of the SubTask-A of the competition.

Anthology ID:: 2023.semeval-1.133
Volume:: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 963–968
Language:
URL:: https://aclanthology.org/2023.semeval-1.133/
DOI:: 10.18653/v1/2023.semeval-1.133
Bibkey:
Cite (ACL):: Yao Zhang and Liqing Wang. 2023. HHS at SemEval-2023 Task 10: A Comparative Analysis of Sexism Detection Based on the RoBERTa Model. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 963–968, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: HHS at SemEval-2023 Task 10: A Comparative Analysis of Sexism Detection Based on the RoBERTa Model (Zhang & Wang, SemEval 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.semeval-1.133.pdf

PDF Cite Search Fix data