Comparing Methods for Measuring Dialect Similarity in Norwegian

Janne Johannessen, Andre Kåsen, Kristin Hagen, Anders Nøklestad, Joel Priestley


Abstract
The present article presents four experiments with two different methods for measuring dialect similarity in Norwegian: the Levenshtein method and the neural long short term memory (LSTM) autoencoder network, a machine learning algorithm. The visual output in the form of dialect maps is then compared with canonical maps found in the dialect literature. All of this enables us to say that one does not need fine-grained transcriptions of speech to replicate classical classification patterns.
Anthology ID:
2020.lrec-1.658
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5343–5350
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.658
DOI:
Bibkey:
Cite (ACL):
Janne Johannessen, Andre Kåsen, Kristin Hagen, Anders Nøklestad, and Joel Priestley. 2020. Comparing Methods for Measuring Dialect Similarity in Norwegian. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 5343–5350, Marseille, France. European Language Resources Association.
Cite (Informal):
Comparing Methods for Measuring Dialect Similarity in Norwegian (Johannessen et al., LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.658.pdf