ArkRepoBench: A Repository-Level Code Completion Benchmark for HarmonyOS Development

Yanlin Wang; Bowen Zhang; Yanli Wang; Daya Guo; Terry Yue Zhuo; Jiachi Chen; Mingwei Liu; Xingong Zhang; Zibin Zheng

ArkRepoBench: A Repository-Level Code Completion Benchmark for HarmonyOS Development

Yanlin Wang, Bowen Zhang, Yanli Wang, Daya Guo, Terry Yue Zhuo, Jiachi Chen, Mingwei Liu, Xingong Zhang, Zibin Zheng

Abstract

ArkTS is the primary programming language for Huawei’s HarmonyOS ecosystem, which has expanded across smartphones, tablets, and IoT devices. While large language models have demonstrated strong code generation capabilities for mainstream languages, their performance on ArkTS remains largely unexplored. To address this gap, we introduce ArkRepoBench, the first repository-level code completion benchmark for ArkTS to our knowledge, 7,519 samples from 20 official HarmonyOS repositories. The benchmark covers multiple difficulty levels and further categorizes completion instances into Single-File, Cross-File Independent, and Cross-File Dependent settings based on dependency analysis, distinguishing the mere presence of cross-file context from its actual necessity. Our experiments show that: (1) ArkTS completion consistently underperforms mainstream languages under our experimental settings, suggesting language-specific challenges associated with this emerging language; (2) open-source 7B models achieve performance comparable to close-source models; (3) cross-file context is a double-edged sword, with sparse retrieval(Jaccard) outperforming dense methods on ArkTS; and (4) HarmonyOS API documentation consistently improves performance, suggesting the benefits of domain-specific enhancements in low-resource settings.

Anthology ID:: 2026.findings-acl.969
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 19409–19429
Language:
URL:: https://aclanthology.org/2026.findings-acl.969/
DOI:
Bibkey:
Cite (ACL):: Yanlin Wang, Bowen Zhang, Yanli Wang, Daya Guo, Terry Yue Zhuo, Jiachi Chen, Mingwei Liu, Xingong Zhang, and Zibin Zheng. 2026. ArkRepoBench: A Repository-Level Code Completion Benchmark for HarmonyOS Development. In Findings of the Association for Computational Linguistics: ACL 2026, pages 19409–19429, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ArkRepoBench: A Repository-Level Code Completion Benchmark for HarmonyOS Development (Wang et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.969.pdf
Checklist:: 2026.findings-acl.969.checklist.pdf

PDF Cite Search Checklist Fix data