AscendKernelGen: LLM-Driven Kernel Generation for NPUs

Xinzi Cao; Jianyang Zhai; Pengfei Li; Zhiheng Hu; Cen Yan; Mubingxu; Guanghuan Fang; Bin She; Jiayu Li; Yihan Su; Dongyang Tao; Feidiao Yang; Chang-Dong Wang; Yutong Lu; Weicheng Xue; Bin Zhou; Yonghong Tian

AscendKernelGen: LLM-Driven Kernel Generation for NPUs

Xinzi Cao, Jianyang Zhai, Pengfei Li, Zhiheng Hu, Cen Yan, Mubingxu, Guanghuan Fang, Bin She, Jiayu Li, Yihan Su, Dongyang Tao, Feidiao Yang, Chang-Dong Wang, Yutong Lu, Weicheng Xue, Bin Zhou, Yonghong Tian

Abstract

Neural Processing Units (NPUs) are critical for AI infrastructure, yet developing kernels remains a bottleneck due to the complexity of vendor-specific Domain-Specific Languages (DSLs). While LLMs excel in general coding, they fail to meet the stringent constraints of NPU development, showing a near-zero success rate on complex kernels in our preliminary study. To address these challenges, we present AscendKernelGen, the first comprehensive framework for NPU kernel development, marking a pioneering effort in this field. This framework consists of three interconnected components: (1) Ascend-CoT, the first dataset in the NPU kernel domain that incorporates chain-of-thought reasoning from real-world kernel implementations; (2) KernelGen-LM, a domain-adaptive model trained on this novel dataset using supervised fine-tuning and reinforcement learning; and (3) NPUKernelBench, the first benchmark platform designed to evaluate the compilation, correctness, and performance of generated NPU kernels. Experimental results demonstrate that our approach dramatically bridges the gap in hardware-specific coding: compilation success on complex Level-2 kernels improves from 0% to 95.5% (Pass@10), with 64% functional correctness. AscendKernGen is available at AscendKernGen and NPUKernelBench.

Anthology ID:: 2026.findings-acl.1533
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 30693–30718
Language:
URL:: https://aclanthology.org/2026.findings-acl.1533/
DOI:
Bibkey:
Cite (ACL):: Xinzi Cao, Jianyang Zhai, Pengfei Li, Zhiheng Hu, Cen Yan, Mubingxu, Guanghuan Fang, Bin She, Jiayu Li, Yihan Su, Dongyang Tao, Feidiao Yang, Chang-Dong Wang, Yutong Lu, Weicheng Xue, Bin Zhou, and Yonghong Tian. 2026. AscendKernelGen: LLM-Driven Kernel Generation for NPUs. In Findings of the Association for Computational Linguistics: ACL 2026, pages 30693–30718, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: AscendKernelGen: LLM-Driven Kernel Generation for NPUs (Cao et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1533.pdf
Checklist:: 2026.findings-acl.1533.checklist.pdf

PDF Cite Search Checklist Fix data