Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking Inigo Urteaga author Moulay Zaidane Draidia author Tomer Lancewicki author Shahram Khadivi author 2023-07 text Findings of the Association for Computational Linguistics: ACL 2023 Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication urteaga-etal-2023-multi 10.18653/v1/2023.findings-acl.675 https://aclanthology.org/2023.findings-acl.675/ 2023-07 10609 10627