A Diachronic Treebank of Russian Spanning More Than a Thousand Years

Aleksandrs Berdicevskis, Hanne Eckhoff


Abstract
We describe the Tromsø Old Russian and Old Church Slavonic Treebank (TOROT) that spans from the earliest Old Church Slavonic to modern Russian texts, covering more than a thousand years of continuous language history. We focus on the latest additions to the treebank, first of all, the modern subcorpus that was created by a high-quality conversion of the existing treebank of contemporary standard Russian (SynTagRus).
Anthology ID:
2020.lrec-1.646
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5251–5256
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.646
DOI:
Bibkey:
Cite (ACL):
Aleksandrs Berdicevskis and Hanne Eckhoff. 2020. A Diachronic Treebank of Russian Spanning More Than a Thousand Years. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 5251–5256, Marseille, France. European Language Resources Association.
Cite (Informal):
A Diachronic Treebank of Russian Spanning More Than a Thousand Years (Berdicevskis & Eckhoff, LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.646.pdf