%0 Conference Proceedings %T Regression Analysis of Lexical and Morpho-Syntactic Properties of Kiezdeutsch %A Frassinelli, Diego %A Lapesa, Gabriella %A Alatrash, Reem %A Schlechtweg, Dominik %A Schulte im Walde, Sabine %Y Zampieri, Marcos %Y Nakov, Preslav %Y Ljubešić, Nikola %Y Tiedemann, Jörg %Y Scherrer, Yves %Y Jauhiainen, Tommi %S Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects %D 2021 %8 April %I Association for Computational Linguistics %C Kiyv, Ukraine %F frassinelli-etal-2021-regression %X Kiezdeutsch is a variety of German predominantly spoken by teenagers from multi-ethnic urban neighborhoods in casual conversations with their peers. In recent years, the popularity of Kiezdeutsch has increased among young people, independently of their socio-economic origin, and has spread in social media, too. While previous studies have extensively investigated this language variety from a linguistic and qualitative perspective, not much has been done from a quantitative point of view. We perform the first large-scale data-driven analysis of the lexical and morpho-syntactic properties of Kiezdeutsch in comparison with standard German. At the level of results, we confirm predictions of previous qualitative analyses and integrate them with further observations on specific linguistic phenomena such as slang and self-centered speaker attitude. At the methodological level, we provide logistic regression as a framework to perform bottom-up feature selection in order to quantify differences across language varieties. %U https://aclanthology.org/2021.vardial-1.3 %P 21-27