Specialized Monolingual BPE Tokenizers for Uralic Languages Representation in Large Language Models Iaroslav Chelombitko author Aleksey Komissarov author 2024-11 text Proceedings of the 9th International Workshop on Computational Linguistics for Uralic Languages Mika Hämäläinen editor Flammie Pirinen editor Melany Macias editor Mario Crespo Avila editor Association for Computational Linguistics Helsinki, Finland conference publication chelombitko-komissarov-2024-specialized https://aclanthology.org/2024.iwclul-1.11/ 2024-11 89 95