Categorizing Web Pages as a Preprocessing Step for Information Extraction Viktor Pekar author Richard Evans author Ruslan Mitkov author 2004-05 text Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04) Maria Teresa Lino editor Maria Francisca Xavier editor Fátima Ferreira editor Rute Costa editor Raquel Silva editor European Language Resources Association (ELRA) Lisbon, Portugal conference publication pekar-etal-2004-categorizing https://aclanthology.org/L04-1328/ 2004-05