Siberian Ingrian Finnish: FST and IGTs

Ivan Ubaleht


Abstract
This paper presents the current version of the finite-state transducer for the Siberian Ingrian Finnish. Our finite-state transducer uses two-level morphology. We use LexC and TwolC languages together with HFST tools to develop lexicons and phonological rules, as well as to compile the transducer. The paper also provides a description of the morphological system of Siberian Ingrian Finnish. In addition, we present a collection of interlinear glossed texts in Siberian Ingrian Finnish, provided in a machine-readable format.
Anthology ID:
2025.iwclul-1.15
Volume:
Proceedings of the 10th International Workshop on Computational Linguistics for Uralic Languages
Month:
December
Year:
2025
Address:
Joensuu, Finland
Editors:
Mika Hämäläinen, Michael Rießler, Eiaki V. Morooka, Lev Kharlashkin
Venues:
IWCLUL | WS
SIG:
SIGUR
Publisher:
Association for Computational Linguistics
Note:
Pages:
123–126
Language:
URL:
https://aclanthology.org/2025.iwclul-1.15/
DOI:
Bibkey:
Cite (ACL):
Ivan Ubaleht. 2025. Siberian Ingrian Finnish: FST and IGTs. In Proceedings of the 10th International Workshop on Computational Linguistics for Uralic Languages, pages 123–126, Joensuu, Finland. Association for Computational Linguistics.
Cite (Informal):
Siberian Ingrian Finnish: FST and IGTs (Ubaleht, IWCLUL 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.iwclul-1.15.pdf