Krzysztof Wróbel
2022
Transformer-based Part-of-Speech Tagging and Lemmatization for Latin
Krzysztof Wróbel
|
Krzysztof Nowak
Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages
The paper presents a submission to the EvaLatin 2022 shared task. Our system places first for lemmatization, part-of-speech and morphological tagging in both closed and open modalities. The results for cross-genre and cross-time sub-tasks show that the system handles the diachronic and diastratic variation of Latin. The architecture employs state-of-the-art transformer models. For part-of-speech and morphological tagging, we use XLM-RoBERTa large, while for lemmatization a ByT5 small model was employed. The paper features a thorough discussion of part-of-speech and lemmatization errors which shows how the system performance may be improved for Classical, Medieval and Neo-Latin texts.
2016
PLUJAGH at SemEval-2016 Task 11: Simple System for Complex Word Identification
Krzysztof Wróbel
Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)
WiTKoM – virtual sign language translator project
Katarzyna Barczewska
|
Jakub Galka
|
Filip Malawski
|
Mariusz Mąsior
|
Dorota Szulc
|
Tomasz Wilczyński
|
Krzysztof Wróbel
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products
Search
Co-authors
- Katarzyna Barczewska 1
- Jakub Galka 1
- Filip Malawski 1
- Mariusz Mąsior 1
- Dorota Szulc 1
- show all...