GC-Hunter at ImageArg Shared Task: Multi-Modal Stance and Persuasiveness Learning

Mohammad Shokri; Sarah Ita Levitan

doi:10.18653/v1/2023.argmining-1.17

GC-Hunter at ImageArg Shared Task: Multi-Modal Stance and Persuasiveness Learning

Abstract

With the rising prominence of social media, users frequently supplement their written content with images. This trend has brought about new challenges in automatic processing of social media messages. In order to fully understand the meaning of a post, it is necessary to capture the relationship between the image and the text. In this work we address the two main objectives of the ImageArg shared task. Firstly, we aim to determine the stance of a multi-modal tweet toward a particular issue. We propose a strong baseline, fine-tuning transformer based models on concatenation of tweet text and image text. The second goal is to predict the impact of an image on the persuasiveness of the text in a multi-modal tweet. To capture the persuasiveness of an image, we train vision and language models on the data and explore other sets of features merged with the model, to enhance prediction power. Ultimately, both of these goals contribute toward the broader aim of understanding multi-modal messages on social media and how images and texts relate to each other.

Anthology ID:: 2023.argmining-1.17
Volume:: Proceedings of the 10th Workshop on Argument Mining
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Milad Alshomary, Chung-Chi Chen, Smaranda Muresan, Joonsuk Park, Julia Romberg
Venues:: ArgMining | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 162–166
Language:
URL:: https://aclanthology.org/2023.argmining-1.17
DOI:: 10.18653/v1/2023.argmining-1.17
Bibkey:
Cite (ACL):: Mohammad Shokri and Sarah Ita Levitan. 2023. GC-Hunter at ImageArg Shared Task: Multi-Modal Stance and Persuasiveness Learning. In Proceedings of the 10th Workshop on Argument Mining, pages 162–166, Singapore. Association for Computational Linguistics.
Cite (Informal):: GC-Hunter at ImageArg Shared Task: Multi-Modal Stance and Persuasiveness Learning (Shokri & Levitan, ArgMining-WS 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.argmining-1.17.pdf

PDF Cite Search