INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models H S V N S Kowndinya Renduchintala author Krishnateja Killamsetty author Sumit Bhatia author Milan Aggarwal author Ganesh Ramakrishnan author Rishabh Iyer author Balaji Krishnamurthy author 2023-12 text Findings of the Association for Computational Linguistics: EMNLP 2023 Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication renduchintala-etal-2023-ingenious 10.18653/v1/2023.findings-emnlp.445 https://aclanthology.org/2023.findings-emnlp.445/ 2023-12 6690 6705