Historical Corpora of Bulgarian Language and Second Position Markers

Tsvetana Dimitrova, Andrej Boyadzhiev


Abstract
This paper demonstrates how historical corpora can be used in researching language phenomena. We exemplify the advantages and disadvantages through exploring three of the available corpora that contain textual sources of Old and Middle Bulgarian language to shed light on some aspects of the development of two words of ambiguous class. We discuss their behaviour to outline certain conditions for diachronic change they have undergone. The three corpora are accessible online (and offline – for downloading search results, xml files, etc.).
Anthology ID:
2014.clib-1.8
Volume:
Proceedings of the First International Conference on Computational Linguistics in Bulgaria (CLIB 2014)
Month:
September
Year:
2014
Address:
Sofia, Bulgaria
Venue:
CLIB
SIG:
Publisher:
Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences
Note:
Pages:
55–63
Language:
URL:
https://aclanthology.org/2014.clib-1.8
DOI:
Bibkey:
Cite (ACL):
Tsvetana Dimitrova and Andrej Boyadzhiev. 2014. Historical Corpora of Bulgarian Language and Second Position Markers. In Proceedings of the First International Conference on Computational Linguistics in Bulgaria (CLIB 2014), pages 55–63, Sofia, Bulgaria. Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences.
Cite (Informal):
Historical Corpora of Bulgarian Language and Second Position Markers (Dimitrova & Boyadzhiev, CLIB 2014)
Copy Citation:
PDF:
https://aclanthology.org/2014.clib-1.8.pdf