Guillermo Villar-Rodríguez
2022
AIDA-UPM at SemEval-2022 Task 5: Exploring Multimodal Late Information Fusion for Multimedia Automatic Misogyny Identification
Álvaro Huertas-García
|
Helena Liz
|
Guillermo Villar-Rodríguez
|
Alejandro Martín
|
Javier Huertas-Tato
|
David Camacho
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
This paper describes the multimodal late fusion model proposed in the SemEval-2022 Multimedia Automatic Misogyny Identification (MAMI) task. The main contribution of this paper is the exploration of different late fusion methods to boost the performance of the combination based on the Transformer-based model and Convolutional Neural Networks (CNN) for text and image, respectively. Additionally, our findings contribute to a better understanding of the effects of different image preprocessing methods for meme classification. We achieve 0.636 F1-macro average score for the binary subtask A, and 0.632 F1-macro average score for the multi-label subtask B. The present findings might help solve the inequality and discrimination women suffer on social media platforms.
Search