Amrita_CEN at SemEval-2022 Task 4: Oversampling-based Machine Learning Approach for Detecting Patronizing and Condescending Language

Bichu George, Adarsh S, Nishitkumar Prajapati, Premjith B, Soman Kp


Abstract
This paper narrates the work of the team Amrita_CEN for the shared task on Patronizing and Condescending Language Detection at SemEval 2022. We implemented machine learning algorithms such as Support Vector Machine (SVV), Logistic regression, Naive Bayes, XG Boost and Random Forest for modelling the tasks. At the same time, we also applied a feature engineering method to solve the class imbalance problem with respect to training data. Among all the models, the logistic regression model outperformed all other models and we have submitted results based upon the same.
Anthology ID:
2022.semeval-1.71
Volume:
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Guy Emerson, Natalie Schluter, Gabriel Stanovsky, Ritesh Kumar, Alexis Palmer, Nathan Schneider, Siddharth Singh, Shyam Ratan
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
515–518
Language:
URL:
https://aclanthology.org/2022.semeval-1.71
DOI:
10.18653/v1/2022.semeval-1.71
Bibkey:
Cite (ACL):
Bichu George, Adarsh S, Nishitkumar Prajapati, Premjith B, and Soman Kp. 2022. Amrita_CEN at SemEval-2022 Task 4: Oversampling-based Machine Learning Approach for Detecting Patronizing and Condescending Language. In Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), pages 515–518, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
Amrita_CEN at SemEval-2022 Task 4: Oversampling-based Machine Learning Approach for Detecting Patronizing and Condescending Language (George et al., SemEval 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.semeval-1.71.pdf
Video:
 https://aclanthology.org/2022.semeval-1.71.mp4