Rongjie Yi
2025
Demystifying Small Language Models for Edge Deployment
Zhenyan Lu
|
Xiang Li
|
Dongqi Cai
|
Rongjie Yi
|
Fangming Liu
|
Wei Liu
|
Jian Luan
|
Xiwen Zhang
|
Nicholas D. Lane
|
Mengwei Xu
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Small language models (SLMs) have emerged as a promising solution for deploying resource-constrained devices, such as smartphones and Web of Things. This work presents the first comprehensive study of over 60 SLMs such as Microsoft Phi and Google Gemma that are publicly accessible. Our findings show that state-of-the-art SLMs outperform 7B models in general tasks, proving their practical viability. However, SLMs’ in-context learning capabilities remain limited, and their efficiency has significant optimization potential. We identify key SLM optimization opportunities, including dynamic task-specific routing, model-hardware co-design, and vocabulary/KV cache compression. Overall, we expect the work to reveal an all-sided landscape of SLMs, benefiting the research community across algorithm, model, system, and hardware levels.
DroidCall: A Dataset for LLM-powered Android Intent Invocation
Weikai Xie
|
Li Zhang
|
Shihe Wang
|
Rongjie Yi
|
Mengwei Xu
Findings of the Association for Computational Linguistics: EMNLP 2025
The growing capabilities of large language models in natural language understanding significantly strengthen existing agentic systems. To power performant on-device mobile agents for better data privacy, we introduce DroidCall, the first training and testing dataset for accurate Android Intent invocation. With a highly flexible and reusable data generation pipeline, we constructed 10k samples in DroidCall. Given a task instruction in natural language, small language models such as Qwen2.5-3B and Gemma2-2B fine-tuned with DroidCall can approach or even surpass the capabilities of GPT-4o for accurate Android intent invocation. We also provide an end-to-end Android app equipped with these fine-tuned models to demonstrate the Android intent invocation process. The code and dataset are available at https://github.com/UbiquitousLearning/DroidCall
Search
Fix author
Co-authors
- Mengwei Xu 2
- Dongqi Cai 1
- Nicholas D. Lane 1
- Xiang Li (李翔) 1
- Fangming Liu 1
- show all...