Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines

Yoo Sung, Jordan Boyd-Graber, Naeemul Hassan


Abstract
Polarization and the marketplace for impressions have conspired to make navigating information online difficult for users, and while there has been a significant effort to detect false or misleading text, multimodal datasets have received considerably less attention. To complement existing resources, we present multimodal Video Misleading Headline (VMH), a dataset that consists of videos and whether annotators believe the headline is representative of the video’s contents. After collecting and annotating this dataset, we analyze multimodal baselines for detecting misleading headlines. Our annotation process also focuses on why annotators view a video as misleading, allowing us to better understand the interplay of annotators’ background and the content of the videos.
Anthology ID:
2023.emnlp-main.1010
Original:
2023.emnlp-main.1010v1
Version 2:
2023.emnlp-main.1010v2
Volume:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
16241–16258
Language:
URL:
https://aclanthology.org/2023.emnlp-main.1010
DOI:
10.18653/v1/2023.emnlp-main.1010
Bibkey:
Cite (ACL):
Yoo Sung, Jordan Boyd-Graber, and Naeemul Hassan. 2023. Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 16241–16258, Singapore. Association for Computational Linguistics.
Cite (Informal):
Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines (Sung et al., EMNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.emnlp-main.1010.pdf
Video:
 https://aclanthology.org/2023.emnlp-main.1010.mp4