SEFLAG: Systematic Evaluation Framework for NLP Models and Datasets in Latin and Ancient Greek

Konstantin Schulz, Florian Deichsler


Abstract
Literary scholars of Latin and Ancient Greek increasingly use natural language processing for their work, but many models and datasets are hard to use due to a lack of sustainable research data management. This paper introduces the Systematic Evaluation Framework for natural language processing models and datasets in Latin and Ancient Greek (SEFLAG), which consistently assesses language resources using common criteria, such as specific evaluation metrics, metadata and risk analysis. The framework, a work in progress in its initial phase, currently covers lemmatization and named entity recognition for both languages, with plans for adding dependency parsing and other tasks. For increased transparency and sustainability, a thorough documentation is included as well as an integration into the HuggingFace ecosystem. The combination of these efforts is designed to support researchers in their search for suitable models.
Anthology ID:
2024.nlp4dh-1.24
Volume:
Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities
Month:
November
Year:
2024
Address:
Miami, USA
Editors:
Mika Hämäläinen, Emily Öhman, So Miyagawa, Khalid Alnajjar, Yuri Bizzoni
Venue:
NLP4DH
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
247–258
Language:
URL:
https://aclanthology.org/2024.nlp4dh-1.24
DOI:
Bibkey:
Cite (ACL):
Konstantin Schulz and Florian Deichsler. 2024. SEFLAG: Systematic Evaluation Framework for NLP Models and Datasets in Latin and Ancient Greek. In Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities, pages 247–258, Miami, USA. Association for Computational Linguistics.
Cite (Informal):
SEFLAG: Systematic Evaluation Framework for NLP Models and Datasets in Latin and Ancient Greek (Schulz & Deichsler, NLP4DH 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.nlp4dh-1.24.pdf