Konstantin Schulz


2024

pdf bib
SEFLAG: Systematic Evaluation Framework for NLP Models and Datasets in Latin and Ancient Greek
Konstantin Schulz | Florian Deichsler
Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities

Literary scholars of Latin and Ancient Greek increasingly use natural language processing for their work, but many models and datasets are hard to use due to a lack of sustainable research data management. This paper introduces the Systematic Evaluation Framework for natural language processing models and datasets in Latin and Ancient Greek (SEFLAG), which consistently assesses language resources using common criteria, such as specific evaluation metrics, metadata and risk analysis. The framework, a work in progress in its initial phase, currently covers lemmatization and named entity recognition for both languages, with plans for adding dependency parsing and other tasks. For increased transparency and sustainability, a thorough documentation is included as well as an integration into the HuggingFace ecosystem. The combination of these efforts is designed to support researchers in their search for suitable models.

2022

pdf bib
Modelling Cultural and Socio-Economic Dimensions of Political Bias in German Tweets
Aishwarya Anegundi | Konstantin Schulz | Christian Rauh | Georg Rehm
Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022)

2020

pdf bib
Intelligenti Pauca - Probing a Novel Alternative to Universal Dependencies for Under-Resourced Languages on Latin
Daniel Couto Vale | Konstantin Schulz
Proceedings of the 19th International Workshop on Treebanks and Linguistic Theories