Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese Kurt Micallef author Albert Gatt author Marc Tanti author Lonneke van der Plas author Claudia Borg author 2022-07 text Proceedings of the Third Workshop on Deep Learning for Low-Resource Natural Language Processing Colin Cherry editor Angela Fan editor George Foster editor Gholamreza (Reza) Haffari editor Shahram Khadivi editor Nanyun (Violet) Peng editor Xiang Ren editor Ehsan Shareghi editor Swabha Swayamdipta editor Association for Computational Linguistics Hybrid conference publication micallef-etal-2022-pre 10.18653/v1/2022.deeplo-1.10 https://aclanthology.org/2022.deeplo-1.10/ 2022-07 90 101