Automatic Extraction of News Values from Headline Text

Alicja Piotrkowicz, Vania Dimitrova, Katja Markert


Abstract
Headlines play a crucial role in attracting audiences’ attention to online artefacts (e.g. news articles, videos, blogs). The ability to carry out an automatic, large-scale analysis of headlines is critical to facilitate the selection and prioritisation of a large volume of digital content. In journalism studies news content has been extensively studied using manually annotated news values - factors used implicitly and explicitly when making decisions on the selection and prioritisation of news items. This paper presents the first attempt at a fully automatic extraction of news values from headline text. The news values extraction methods are applied on a large headlines corpus collected from The Guardian, and evaluated by comparing it with a manually annotated gold standard. A crowdsourcing survey indicates that news values affect people’s decisions to click on a headline, supporting the need for an automatic news values detection.
Anthology ID:
E17-4007
Volume:
Proceedings of the Student Research Workshop at the 15th Conference of the European Chapter of the Association for Computational Linguistics
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Florian Kunneman, Uxoa Iñurrieta, John J. Camilleri, Mariona Coll Ardanuy
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
64–74
Language:
URL:
https://aclanthology.org/E17-4007
DOI:
Bibkey:
Cite (ACL):
Alicja Piotrkowicz, Vania Dimitrova, and Katja Markert. 2017. Automatic Extraction of News Values from Headline Text. In Proceedings of the Student Research Workshop at the 15th Conference of the European Chapter of the Association for Computational Linguistics, pages 64–74, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Automatic Extraction of News Values from Headline Text (Piotrkowicz et al., EACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/E17-4007.pdf