UMUTeam at SemEval-2023 Task 10: Fine-grained detection of sexism in English

Ronghao Pan; José Antonio García-Díaz; Salud María Jiménez-Zafra; Rafael Valencia-García

doi:10.18653/v1/2023.semeval-1.80

UMUTeam at SemEval-2023 Task 10: Fine-grained detection of sexism in English

Ronghao Pan, José Antonio García-Díaz, Salud María Jiménez Zafra, Rafael Valencia-García

Abstract

In this manuscript, we describe the participation of UMUTeam in the Explainable Detection of Online Sexism shared task proposed at SemEval 2023. This task concerns the precise and explainable detection of sexist content on Gab and Reddit, i.e., developing detailed classifiers that not only identify what is sexist, but also explain why it is sexism. Our participation in the three EDOS subtasks is based on extending new unlabeled sexism data in the Masked Language Model task of a pre-trained model, such as RoBERTa-large to improve its generalization capacity and its performance on classification tasks. Once the model has been pre-trained with the new data, fine-tuning of this model is performed for different specific sexism classification tasks. Our system has achieved excellent results in this competitive task, reaching top 24 (84) in Task A, top 23 (69) in Task B, and top 13 (63) in Task C.

Anthology ID:: 2023.semeval-1.80
Volume:: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 589–594
Language:
URL:: https://aclanthology.org/2023.semeval-1.80/
DOI:: 10.18653/v1/2023.semeval-1.80
Bibkey:
Cite (ACL):: Ronghao Pan, José Antonio García-Díaz, Salud María Jiménez Zafra, and Rafael Valencia-García. 2023. UMUTeam at SemEval-2023 Task 10: Fine-grained detection of sexism in English. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 589–594, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: UMUTeam at SemEval-2023 Task 10: Fine-grained detection of sexism in English (Pan et al., SemEval 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.semeval-1.80.pdf

PDF Cite Search Fix data