Cache & Distil: Optimising API Calls to Large Language Models Guillem Ramírez author Matthias Lindemann author Alexandra Birch author Ivan Titov author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication ramirez-etal-2024-cache 10.18653/v1/2024.findings-acl.704 https://aclanthology.org/2024.findings-acl.704/ 2024-08 11838 11853