Unified Annotation of the Stages of the Bulgarian Language. First Steps

Fabio Maion, Tsvetana Dimitrova, Andrej Bojadziev


Abstract
The paper reports on an ongoing work on a proposal of guidelines for unified annotation of the stages in the development of the Bulgarian language from the Middle Ages to the early modern period. It discusses the criteria for the selection of texts and their representation, along with some results of the trial tagging with an existing tagger which was already trained on other texts.
Anthology ID:
2024.clib-1.24
Volume:
Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024)
Month:
September
Year:
2024
Address:
Sofia, Bulgaria
Venue:
CLIB
SIG:
Publisher:
Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences
Note:
Pages:
220–226
Language:
URL:
https://aclanthology.org/2024.clib-1.24
DOI:
Bibkey:
Cite (ACL):
Fabio Maion, Tsvetana Dimitrova, and Andrej Bojadziev. 2024. Unified Annotation of the Stages of the Bulgarian Language. First Steps. In Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024), pages 220–226, Sofia, Bulgaria. Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences.
Cite (Informal):
Unified Annotation of the Stages of the Bulgarian Language. First Steps (Maion et al., CLIB 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.clib-1.24.pdf