Octopus: On-device language model for function calling of software APIs

Wei Chen; Zhiyuan Li; Mingyuan Ma

doi:10.18653/v1/2025.naacl-industry.27

Octopus: On-device language model for function calling of software APIs

Abstract

Large Language Models (LLMs) are pivotal for advanced text processing and generation. This study presents a framework to train a series of on-device LLMs optimized for invoking software APIs. Using a curated dataset of 30,000 API function calls from software documentation, we fine-tune LLMs with 2B, 3B, and 7B parameters to enhance their proficiency in API interactions. Our approach improves the understanding of API structures and syntax, leading to significantly better accuracy in API function calls. We also propose a conditional masking technique to enforce correct output formats, reducing errors while maintaining inference speed, specifically tailored for API tasks. The fine-tuned model, Octopus, outperforms GPT-4 in API calling tasks, showcasing advancements in automated software development and API integration. The model checkpoints are publicly available.

Anthology ID:: 2025.naacl-industry.27
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Weizhu Chen, Yi Yang, Mohammad Kachuee, Xue-Yong Fu
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 329–339
Language:
URL:: https://aclanthology.org/2025.naacl-industry.27/
DOI:: 10.18653/v1/2025.naacl-industry.27
Bibkey:
Cite (ACL):: Wei Chen, Zhiyuan Li, and Mingyuan Ma. 2025. Octopus: On-device language model for function calling of software APIs. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track), pages 329–339, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Octopus: On-device language model for function calling of software APIs (Chen et al., NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-industry.27.pdf

PDF Cite Search Fix data