STAPI: An Automatic Scraper for Extracting Iterative Title-Text Structure from Web Documents Nan Zhang author Shomir Wilson author Prasenjit Mitra author 2022-06 text Proceedings of the Thirteenth Language Resources and Evaluation Conference Nicoletta Calzolari editor Frédéric Béchet editor Philippe Blache editor Khalid Choukri editor Christopher Cieri editor Thierry Declerck editor Sara Goggi editor Hitoshi Isahara editor Bente Maegaard editor Joseph Mariani editor Hélène Mazo editor Jan Odijk editor Stelios Piperidis editor European Language Resources Association Marseille, France conference publication zhang-etal-2022-stapi https://aclanthology.org/2022.lrec-1.371/ 2022-06 3461 3470