PCJ at SemEval-2023 Task 10: A Ensemble Model Based on Pre-trained Model for Sexism Detection and Classification in English

Chujun Pu, Xiaobing Zhou


Abstract
This paper describes the system and the resulting model submitted by our team “PCJ” to the SemEval-2023 Task 10 sub-task A contest. In this task, we need to test the English text content in the posts to determine whether there is sexism, which involves emotional text classification. Our submission system utilizes methods based on RoBERTa, SimCSE-RoBERTa pre-training models, and model ensemble to classify and train on datasets provided by the organizers. In the final assessment, our submission achieved a macro average F1 score of 0.8449, ranking 28th out of 84 teams in Task A.
Anthology ID:
2023.semeval-1.59
Volume:
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
433–438
Language:
URL:
https://aclanthology.org/2023.semeval-1.59
DOI:
10.18653/v1/2023.semeval-1.59
Bibkey:
Cite (ACL):
Chujun Pu and Xiaobing Zhou. 2023. PCJ at SemEval-2023 Task 10: A Ensemble Model Based on Pre-trained Model for Sexism Detection and Classification in English. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 433–438, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
PCJ at SemEval-2023 Task 10: A Ensemble Model Based on Pre-trained Model for Sexism Detection and Classification in English (Pu & Zhou, SemEval 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.semeval-1.59.pdf