A Warm Start and a Clean Crawled Corpus - A Recipe for Good Language Models Vésteinn Snæbjarnarson author Haukur Barri Símonarson author Pétur Orri Ragnarsson author Svanhvít Lilja Ingólfsdóttir author Haukur Jónsson author Vilhjalmur Thorsteinsson author Hafsteinn Einarsson author 2022-06 text Proceedings of the Thirteenth Language Resources and Evaluation Conference Nicoletta Calzolari editor Frédéric Béchet editor Philippe Blache editor Khalid Choukri editor Christopher Cieri editor Thierry Declerck editor Sara Goggi editor Hitoshi Isahara editor Bente Maegaard editor Joseph Mariani editor Hélène Mazo editor Jan Odijk editor Stelios Piperidis editor European Language Resources Association Marseille, France conference publication snaebjarnarson-etal-2022-warm https://aclanthology.org/2022.lrec-1.464/ 2022-06 4356 4366