Identification of Multimodal Stance Towards Frames of Communication

Maxwell Weinzierl; Sanda Harabagiu

doi:10.18653/v1/2023.emnlp-main.776

Identification of Multimodal Stance Towards Frames of Communication

Abstract

Frames of communication are often evoked in multimedia documents. When an author decides to add an image to a text, one or both of the modalities may evoke a communication frame. Moreover, when evoking the frame, the author also conveys her/his stance towards the frame. Until now, determining if the author is in favor of, against or has no stance towards the frame was performed automatically only when processing texts. This is due to the absence of stance annotations on multimedia documents. In this paper we introduce MMVax-Stance, a dataset of 11,300 multimedia documents retrieved from social media, which have stance annotations towards 113 different frames of communication. This dataset allowed us to experiment with several models of multimedia stance detection, which revealed important interactions between texts and images in the inference of stance towards communication frames. When inferring the text/image relations, a set of 46,606 synthetic examples of multimodal documents with known stance was generated. This greatly impacted the quality of identifying multimedia stance, yielding an improvement of 20% in F1-score.

Anthology ID:: 2023.emnlp-main.776
Volume:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Houda Bouamor, Juan Pino, Kalika Bali
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12597–12609
Language:
URL:: https://aclanthology.org/2023.emnlp-main.776/
DOI:: 10.18653/v1/2023.emnlp-main.776
Bibkey:
Cite (ACL):: Maxwell Weinzierl and Sanda Harabagiu. 2023. Identification of Multimodal Stance Towards Frames of Communication. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 12597–12609, Singapore. Association for Computational Linguistics.
Cite (Informal):: Identification of Multimodal Stance Towards Frames of Communication (Weinzierl & Harabagiu, EMNLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.emnlp-main.776.pdf
Video:: https://aclanthology.org/2023.emnlp-main.776.mp4

PDF Cite Search Video Fix data