Tanja Wissik
2022
The ALPIN Sentiment Dictionary: Austrian Language Polarity in Newspapers
Thomas Kolb | Sekanina Katharina | Bettina Manuela Johanna Kern | Julia Neidhardt | Tanja Wissik | Andreas Baumann
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Thomas Kolb | Sekanina Katharina | Bettina Manuela Johanna Kern | Julia Neidhardt | Tanja Wissik | Andreas Baumann
Proceedings of the Thirteenth Language Resources and Evaluation Conference
This paper introduces the Austrian German sentiment dictionary ALPIN to account for the lack of resources for dictionary-based sentiment analysis in this specific variety of German, which is characterized by lexical idiosyncrasies that also affect word sentiment. The proposed language resource is based on Austrian news media in the field of politics, an austriacism list based on different resources and a posting data set based on a popular Austrian news media. Different resources are used to increase the diversity of the resulting language resource. Extensive crowd-sourcing is performed followed by evaluation and automatic conversion into sentiment scores. We show that crowd-sourcing enables the creation of a sentiment dictionary for the Austrian German domain. Additionally, the different parts of the sentiment dictionary are evaluated to show their impact on the resulting resource. Furthermore, the proposed dictionary is utilized in a web application and available for future research and free to use for anyone.
Visualizing Parliamentary Speeches as Networks: the DYLEN Tool
Seung-bin Yim | Katharina Wünsche | Asil Cetin | Julia Neidhardt | Andreas Baumann | Tanja Wissik
Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference
Seung-bin Yim | Katharina Wünsche | Asil Cetin | Julia Neidhardt | Andreas Baumann | Tanja Wissik
Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference
In this paper, we present a web based interactive visualization tool for lexical networks based on the utterances of Austrian Members of Parliament. The tool is designed to compare two networks in parallel and is composed of graph visualization, node-metrics comparison and time-series comparison components that are interconnected with each other.
2020
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
Sina Ahmadi | John P. McCrae | Sanni Nimb | Fahad Khan | Monica Monachini | Bolette S. Pedersen | Thierry Declerck | Tanja Wissik | Andrea Bellandi | Irene Pisani | Thomas Troelsgård | Sussi Olsen | Simon Krek | Veronika Lipp | Tamás Váradi | László Simon | András Győrffy | Carole Tiberius | Tanneke Schoonheim | Yifat Ben Moshe | Maya Rudich | Raya Abu Ahmad | Dorielle Lonke | Kira Kovalenko | Margit Langemets | Jelena Kallas | Oksana Dereza | Theodorus Fransen | David Cillessen | David Lindemann | Mikel Alonso | Ana Salgado | José Luis Sancho | Rafael-J. Ureña-Ruiz | Jordi Porta Zamorano | Kiril Simov | Petya Osenova | Zara Kancheva | Ivaylo Radev | Ranka Stanković | Andrej Perdih | Dejan Gabrovšek
Proceedings of the Twelfth Language Resources and Evaluation Conference
Sina Ahmadi | John P. McCrae | Sanni Nimb | Fahad Khan | Monica Monachini | Bolette S. Pedersen | Thierry Declerck | Tanja Wissik | Andrea Bellandi | Irene Pisani | Thomas Troelsgård | Sussi Olsen | Simon Krek | Veronika Lipp | Tamás Váradi | László Simon | András Győrffy | Carole Tiberius | Tanneke Schoonheim | Yifat Ben Moshe | Maya Rudich | Raya Abu Ahmad | Dorielle Lonke | Kira Kovalenko | Margit Langemets | Jelena Kallas | Oksana Dereza | Theodorus Fransen | David Cillessen | David Lindemann | Mikel Alonso | Ana Salgado | José Luis Sancho | Rafael-J. Ureña-Ruiz | Jordi Porta Zamorano | Kiril Simov | Petya Osenova | Zara Kancheva | Ivaylo Radev | Ranka Stanković | Andrej Perdih | Dejan Gabrovšek
Proceedings of the Twelfth Language Resources and Evaluation Conference
Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data will pave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriously requiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA.
Comparing Lexical Usage in Political Discourse across Diachronic Corpora
Klaus Hofmann | Anna Marakasova | Andreas Baumann | Julia Neidhardt | Tanja Wissik
Proceedings of the Second ParlaCLARIN Workshop
Klaus Hofmann | Anna Marakasova | Andreas Baumann | Julia Neidhardt | Tanja Wissik
Proceedings of the Second ParlaCLARIN Workshop
Most diachronic studies on both lexico-semantic change and political language usage are based on individual or comparable corpora. In this paper, we explore ways of studying the stability (and changeability) of lexical usage in political discourse across two corpora which are substantially different in structure and size. We present a case study focusing on lexical items associated with political parties in two diachronic corpora of Austrian German, namely a diachronic media corpus (AMC) and a corpus of parliamentary records (ParlAT), and measure the cross-temporal stability of lexical usage over a period of 20 years. We conduct three sets of comparative analyses investigating a) the stability of sets of lexical items associated with the three major political parties over time, b) lexical similarity between parties, and c) the similarity between the lexical choices in parliamentary speeches by members of the parties vis-‘a-vis the media’s reporting on the parties. We employ time series modeling using generalized additive models (GAMs) to compare the lexical similarities and differences between parties within and across corpora. The results show that changes observed in these measures can be meaningfully related to political events during that time.
2012
Search
Fix author
Co-authors
- Andreas Baumann 3
- Julia Neidhardt 3
- Raya Abu Ahmad 1
- Sina Ahmadi 1
- Mikel Alonso 1
- Andrea Bellandi 1
- Yifat Ben Moshe 1
- Asil Cetin 1
- David Cillessen 1
- Thierry Declerck 1
- Oksana Dereza 1
- Theodorus Fransen 1
- Dejan Gabrovšek 1
- András Győrffy 1
- Klaus Hofmann 1
- Jelena Kallas 1
- Zara Kancheva 1
- Sekanina Katharina 1
- Bettina Manuela Johanna Kern 1
- Fahad Khan 1
- Thomas Kolb 1
- Kira Kovalenko 1
- Simon Krek 1
- Margit Langemets 1
- David Lindemann 1
- Veronika Lipp 1
- Dorielle Lonke 1
- Vesna Lušicky 1
- Anna Marakasova 1
- John Philip McCrae 1
- Monica Monachini 1
- Sanni Nimb 1
- Sussi Olsen 1
- Petya Osenova 1
- Bolette Sandford Pedersen 1
- Andrej Perdih 1
- Irene Pisani 1
- Ivaylo Radev 1
- Maya Rudich 1
- Ana Salgado 1
- José-Luis Sancho 1
- Tanneke Schoonheim 1
- László Simon 1
- Kiril Simov 1
- Ranka Stanković 1
- Carole Tiberius 1
- Thomas Troelsgård 1
- Rafael-J. Ureña-Ruiz 1
- Tamás Váradi 1
- Katharina Wünsche 1
- Seung-bin Yim 1
- Jordi Porta Zamorano 1