%0 Conference Proceedings %T Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %E Zampieri, Marcos %E Nakov, Preslav %E Ljubešić, Nikola %E Tiedemann, Jörg %E Scherrer, Yves %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F vardial-2020-nlp %U https://aclanthology.org/2020.vardial-1.0 %0 Conference Proceedings %T A Report on the VarDial Evaluation Campaign 2020 %A Gaman, Mihaela %A Hovy, Dirk %A Ionescu, Radu Tudor %A Jauhiainen, Heidi %A Jauhiainen, Tommi %A Lindén, Krister %A Ljubešić, Nikola %A Partanen, Niko %A Purschke, Christoph %A Scherrer, Yves %A Zampieri, Marcos %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F gaman-etal-2020-report %U https://aclanthology.org/2020.vardial-1.1 %P 1-14 %0 Conference Proceedings %T ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German %A Nigmatulina, Iuliia %A Kew, Tannon %A Samardzic, Tanja %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F nigmatulina-etal-2020-asr %U https://aclanthology.org/2020.vardial-1.2 %P 15-24 %0 Conference Proceedings %T LSDC - A comprehensive dataset for Low Saxon Dialect Classification %A Siewert, Janine %A Scherrer, Yves %A Wieling, Martijn %A Tiedemann, Jörg %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F siewert-etal-2020-lsdc %U https://aclanthology.org/2020.vardial-1.3 %P 25-35 %0 Conference Proceedings %T Machine-oriented NMT Adaptation for Zero-shot NLP tasks: Comparing the Usefulness of Close and Distant Languages %A Tebbifakhr, Amirhossein %A Negri, Matteo %A Turchi, Marco %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F tebbifakhr-etal-2020-machine %U https://aclanthology.org/2020.vardial-1.4 %P 36-46 %0 Conference Proceedings %T Character Alignment in Morphologically Complex Translation Sets for Related Languages %A Gasser, Michael %A Seyoum, Binyam Ephrem %A Kifle, Nazareth Amlesom %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F gasser-etal-2020-character %U https://aclanthology.org/2020.vardial-1.5 %P 47-56 %0 Conference Proceedings %T Bilingual Lexicon Induction across Orthographically-distinct Under-Resourced Dravidian Languages %A Chakravarthi, Bharathi Raja %A Rajasekaran, Navaneethan %A Arcan, Mihael %A McGuinness, Kevin %A E. O’Connor, Noel %A McCrae, John P. %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F chakravarthi-etal-2020-bilingual %U https://aclanthology.org/2020.vardial-1.6 %P 57-69 %0 Conference Proceedings %T Building a Corpus for the Zaza–Gorani Language Family %A Ahmadi, Sina %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F ahmadi-2020-building %U https://aclanthology.org/2020.vardial-1.7 %P 70-78 %0 Conference Proceedings %T Dealing with dialectal variation in the construction of the Basque historical corpus %A Estarrona, Ainara %A Etxeberria, Izaskun %A Etxepare, Ricardo %A Padilla-Moyano, Manuel %A Soraluze, Ander %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F estarrona-etal-2020-dealing %U https://aclanthology.org/2020.vardial-1.8 %P 79-89 %0 Conference Proceedings %T Recycling and Comparing Morphological Annotation Models for Armenian Diachronic-Variational Corpus Processing %A Vidal-Gorène, Chahan %A Khurshudyan, Victoria %A Donabédian-Demopoulos, Anaïd %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F vidal-gorene-etal-2020-recycling %U https://aclanthology.org/2020.vardial-1.9 %P 90-101 %0 Conference Proceedings %T Neural Machine Translation for translating into Croatian and Serbian %A Popović, Maja %A Poncelas, Alberto %A Brkic, Marija %A Way, Andy %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F popovic-etal-2020-neural %U https://aclanthology.org/2020.vardial-1.10 %P 102-113 %0 Conference Proceedings %T A Tokenization System for the Kurdish Language %A Ahmadi, Sina %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F ahmadi-2020-tokenization %U https://aclanthology.org/2020.vardial-1.11 %P 114-127 %0 Conference Proceedings %T Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification %A Abdullah, Badr M. %A Kudera, Jacek %A Avgustinova, Tania %A Möbius, Bernd %A Klakow, Dietrich %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F abdullah-etal-2020-rediscovering %U https://aclanthology.org/2020.vardial-1.12 %P 128-139 %0 Conference Proceedings %T A Four-Dialect Treebank for Occitan: Building Process and Parsing Experiments %A Miletic, Aleksandra %A Bras, Myriam %A Vergez-Couret, Marianne %A Esher, Louise %A Poujade, Clamença %A Sibille, Jean %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F miletic-etal-2020-four %U https://aclanthology.org/2020.vardial-1.13 %P 140-149 %0 Conference Proceedings %T Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian Language %A Zugarini, Andrea %A Tiezzi, Matteo %A Maggini, Marco %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F zugarini-etal-2020-vulgaris %U https://aclanthology.org/2020.vardial-1.14 %P 150-159 %0 Conference Proceedings %T Towards Augmenting Lexical Resources for Slang and African American English %A Hwang, Alyssa %A Frey, William R. %A McKeown, Kathleen %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F hwang-etal-2020-towards %U https://aclanthology.org/2020.vardial-1.15 %P 160-172 %0 Conference Proceedings %T Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpora %A Jauhiainen, Tommi %A Jauhiainen, Heidi %A Partanen, Niko %A Lindén, Krister %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F jauhiainen-etal-2020-uralic %U https://aclanthology.org/2020.vardial-1.16 %P 173-185 %0 Conference Proceedings %T Dialect Identification under Domain Shift: Experiments with Discriminating Romanian and Moldavian %A Çöltekin, Çağrı %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F coltekin-2020-dialect %U https://aclanthology.org/2020.vardial-1.17 %P 186-192 %0 Conference Proceedings %T Applying Multilingual and Monolingual Transformer-Based Models for Dialect Identification %A Popa, Cristian %A \textcommabelowStefănescu, Vlad %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F popa-stefanescu-2020-applying %U https://aclanthology.org/2020.vardial-1.18 %P 193-201 %0 Conference Proceedings %T HeLju@VarDial 2020: Social Media Variety Geolocation with BERT Models %A Scherrer, Yves %A Ljubešić, Nikola %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F scherrer-ljubesic-2020-helju %U https://aclanthology.org/2020.vardial-1.19 %P 202-211 %0 Conference Proceedings %T A dual-encoding system for dialect classification %A Rebeja, Petru %A Cristea, Dan %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F rebeja-cristea-2020-dual %U https://aclanthology.org/2020.vardial-1.20 %P 212-219 %0 Conference Proceedings %T Experiments in Language Variety Geolocation and Dialect Identification %A Jauhiainen, Tommi %A Jauhiainen, Heidi %A Lindén, Krister %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F jauhiainen-etal-2020-experiments %U https://aclanthology.org/2020.vardial-1.21 %P 220-231 %0 Conference Proceedings %T Exploring the Power of Romanian BERT for Dialect Identification %A Zaharia, George-Eduard %A Avram, Andrei-Marius %A Cercel, Dumitru-Clementin %A Rebedea, Traian %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F zaharia-etal-2020-exploring %U https://aclanthology.org/2020.vardial-1.22 %P 232-241 %0 Conference Proceedings %T Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets %A Gaman, Mihaela %A Ionescu, Radu Tudor %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F gaman-ionescu-2020-combining %U https://aclanthology.org/2020.vardial-1.23 %P 242-253 %0 Conference Proceedings %T ZHAW-InIT - Social Media Geolocation at VarDial 2020 %A Benites, Fernando %A Hürlimann, Manuela %A von Däniken, Pius %A Cieliebak, Mark %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F benites-etal-2020-zhaw %U https://aclanthology.org/2020.vardial-1.24 %P 254-264 %0 Conference Proceedings %T Discriminating between standard Romanian and Moldavian tweets using filtered character ngrams %A Ceolin, Andrea %A Zhang, Hong %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F ceolin-zhang-2020-discriminating %U https://aclanthology.org/2020.vardial-1.25 %P 265-272 %0 Conference Proceedings %T Challenges in Neural Language Identification: NRC at VarDial 2020 %A Bernier-Colborne, Gabriel %A Goutte, Cyril %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F bernier-colborne-goutte-2020-challenges %U https://aclanthology.org/2020.vardial-1.26 %P 273-282 %0 Conference Proceedings %T Geolocation of Tweets with a BiLSTM Regression Model %A Mishra, Piyush %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %S Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects %D 2020 %8 December %I International Committee on Computational Linguistics (ICCL) %C Barcelona, Spain (Online) %F mishra-2020-geolocation %U https://aclanthology.org/2020.vardial-1.27 %P 283-289