EMBEDDIA Tools, Datasets and Challenges: Resources and Hackathon Contributions

Senja Pollak, Marko Robnik-Šikonja, Matthew Purver, Michele Boggia, Ravi Shekhar, Marko Pranjić, Salla Salmela, Ivar Krustok, Tarmo Paju, Carl-Gustav Linden, Leo Leppänen, Elaine Zosa, Matej Ulčar, Linda Freienthal, Silver Traat, Luis Adrián Cabrera-Diego, Matej Martinc, Nada Lavrač, Blaž Škrlj, Martin Žnidaršič, Andraž Pelicon, Boshko Koloski, Vid Podpečan, Janez Kranjc, Shane Sheehan, Emanuela Boros, Jose G. Moreno, Antoine Doucet, Hannu Toivonen


Abstract
This paper presents tools and data sources collected and released by the EMBEDDIA project, supported by the European Union’s Horizon 2020 research and innovation program. The collected resources were offered to participants of a hackathon organized as part of the EACL Hackashop on News Media Content Analysis and Automated Report Generation in February 2021. The hackathon had six participating teams who addressed different challenges, either from the list of proposed challenges or their own news-industry-related tasks. This paper goes beyond the scope of the hackathon, as it brings together in a coherent and compact form most of the resources developed, collected and released by the EMBEDDIA project. Moreover, it constitutes a handy source for news media industry and researchers in the fields of Natural Language Processing and Social Science.
Anthology ID:
2021.hackashop-1.14
Volume:
Proceedings of the EACL Hackashop on News Media Content Analysis and Automated Report Generation
Month:
April
Year:
2021
Address:
Online
Venues:
EACL | Hackashop
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
99–109
Language:
URL:
https://aclanthology.org/2021.hackashop-1.14
DOI:
Bibkey:
Cite (ACL):
Senja Pollak, Marko Robnik-Šikonja, Matthew Purver, Michele Boggia, Ravi Shekhar, Marko Pranjić, Salla Salmela, Ivar Krustok, Tarmo Paju, Carl-Gustav Linden, Leo Leppänen, Elaine Zosa, Matej Ulčar, Linda Freienthal, Silver Traat, Luis Adrián Cabrera-Diego, Matej Martinc, Nada Lavrač, Blaž Škrlj, et al.. 2021. EMBEDDIA Tools, Datasets and Challenges: Resources and Hackathon Contributions. In Proceedings of the EACL Hackashop on News Media Content Analysis and Automated Report Generation, pages 99–109, Online. Association for Computational Linguistics.
Cite (Informal):
EMBEDDIA Tools, Datasets and Challenges: Resources and Hackathon Contributions (Pollak et al., Hackashop 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.hackashop-1.14.pdf