The ELTE.DH Pilot Corpus – Creating a Handcrafted Gigaword Web Corpus with Metadata Balázs Indig author Árpád Knap author Zsófia Sárközi-Lindner author Mária Timári author Gábor Palkó author 2020-05 text eng Proceedings of the 12th Web as Corpus Workshop Adrien Barbaresi editor Felix Bildhauer editor Roland Schäfer editor Egon Stemle editor European Language Resources Association Marseille, France conference publication 979-10-95546-68-9 indig-etal-2020-elte https://aclanthology.org/2020.wac-1.5/ 2020-05 33 41