Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpora Tommi Jauhiainen author Heidi Jauhiainen author Niko Partanen author Krister Lindén author 2020-12 text Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects Marcos Zampieri editor Preslav Nakov editor Nikola Ljubešić editor Jörg Tiedemann editor Yves Scherrer editor International Committee on Computational Linguistics (ICCL) Barcelona, Spain (Online) conference publication jauhiainen-etal-2020-uralic https://aclanthology.org/2020.vardial-1.16/ 2020-12 173 185