NewYeS: A Corpus of New Year’s Speeches with a Comparative Analysis

Anna Tramarin, Carlo Strapparava


Abstract
This paper introduces the NewYeS corpus, which contains the Christmas messages and New Year’s speeches held at the end of the year by the heads of state of different European countries (namely Denmark, France, Italy, Norway, Spain and the United Kingdom). The corpus was collected via web scraping of the speech transcripts available online. A comparative analysis was conducted to examine some of the cultural differences showing through the texts, namely a frequency distribution analysis of the term “God” and the identification of the three most frequent content words per year, with a focus on years in which significant historical events happened. An analysis of positive and negative emotion scores, examined along with the frequency of religious references, was carried out for those countries whose languages are supported by LIWC, a tool for sentiment analysis. The corpus is available for further analyses, both comparative (across countries) and diachronic (over the years).
Anthology ID:
2022.politicalnlp-1.1
Volume:
Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Haithem Afli, Mehwish Alam, Houda Bouamor, Cristina Blasi Casagran, Colleen Boland, Sahar Ghannay
Venue:
PoliticalNLP
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1–7
Language:
URL:
https://aclanthology.org/2022.politicalnlp-1.1
DOI:
Bibkey:
Cite (ACL):
Anna Tramarin and Carlo Strapparava. 2022. NewYeS: A Corpus of New Year’s Speeches with a Comparative Analysis. In Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences, pages 1–7, Marseille, France. European Language Resources Association.
Cite (Informal):
NewYeS: A Corpus of New Year’s Speeches with a Comparative Analysis (Tramarin & Strapparava, PoliticalNLP 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.politicalnlp-1.1.pdf