%0 Conference Proceedings %T The OPUS Corpus - Parallel and Free: http://logos.uio.no/opus %A Tiedemann, Jörg %A Nygaard, Lars %Y Lino, Maria Teresa %Y Xavier, Maria Francisca %Y Ferreira, Fátima %Y Costa, Rute %Y Silva, Raquel %S Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04) %D 2004 %8 May %I European Language Resources Association (ELRA) %C Lisbon, Portugal %F tiedemann-nygaard-2004-opus %X The OPUS corpus is a growing collection of translated documents collected from the internet. The current version contains about 30 million words in 60 languages. The entire corpus is sentence aligned and it also contains linguistic markup for certain languages. %U http://www.lrec-conf.org/proceedings/lrec2004/pdf/320.pdf