Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference Jihwan Bang author Juntae Lee author Kyuhong Shim author Seunghan Yang author Simyung Chang author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication bang-etal-2024-crayon 10.18653/v1/2024.acl-long.204 https://aclanthology.org/2024.acl-long.204/ 2024-08 3720 3731