%0 Conference Proceedings %T HeLju@VarDial 2020: Social Media Variety Geolocation with BERT Models %A Scherrer, Yves %A Ljubešić, Nikola %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F scherrer-ljubesic-2020-helju %X This paper describes the Helsinki-Ljubljana contribution to the VarDial shared task on social media variety geolocation. Our solutions are based on the BERT Transformer models, the constrained versions of our models reaching 1st place in two subtasks and 3rd place in one subtask, while our unconstrained models outperform all the constrained systems by a large margin. We show in our analyses that Transformer-based models outperform traditional models by far, and that improvements obtained by pre-training models on large quantities of (mostly standard) text are significant, but not drastic, with single-language models also outperforming multilingual models. Our manual analysis shows that two types of signals are the most crucial for a (mis)prediction: named entities and dialectal features, both of which are handled well by our models. %U https://aclanthology.org/2020.vardial-1.19 %P 202-211