NoWaC: a large web-based corpus for Norwegian Emiliano Raul Guevara author 2010-06 text Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop Adam Kilgarriff editor Dekang Lin editor Association for Computational Linguistics NAACL-HLT, Los Angeles conference publication guevara-2010-nowac https://aclanthology.org/W10-1501/ 2010-06 1 7