EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation Paul Baker author Andrew Hardie author Tony McEnery author Hamish Cunningham author Rob Gaizauskas author 2002-05 text Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02) Manuel González Rodríguez editor Carmen Paz Suarez Araujo editor European Language Resources Association (ELRA) Las Palmas, Canary Islands - Spain conference publication baker-etal-2002-emille https://aclanthology.org/L02-1319/ 2002-05