CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark

Ahmed Heakl; Gustavo Bertolo Stahl; Sarim Hashmi; Seung Hun Eddie Han; Mukul Ranjan; Arina Kharlamova; Salman Khan; Abdulrahman Mahmoud

CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark

Ahmed Heakl, Gustavo Bertolo Stahl, Sarim Hashmi, Seung Hun Eddie Han, Mukul Ranjan, Arina Kharlamova, Salman Khan, Abdulrahman Mahmoud

Abstract

Cross-architecture GPU code transpilation is essential for unlocking low-level hardware portability, yet no scalable solution exists. We introduce CASS, the first dataset and model suite for source- and assembly-level GPU translation (CUDA ↔ HIP, SASS ↔ RDNA3). CASS contains 60k verified host-device code pairs, enabling learning-based translation across both ISA and runtime boundaries. We generate each sample using our automated pipeline that scrapes, translates, compiles, and aligns GPU programs across vendor stacks. Leveraging CASS, we train a suite of domain-specific translation models that achieve 88.2% accuracy on CUDA → HIP and 69.1% on SASS → RDNA3, outperforming commercial baselines including GPT-5.1, Claude-4.5, and Hipify by wide margins. Generated code matches native performance in 85% of cases, preserving both runtime and memory behavior. To support rigorous evaluation, we introduce CASS-Bench, a curated benchmark spanning 18 GPU domains with ground-truth execution. All data, models, and evaluation tools will be released as open source to support progress in GPU compiler tooling, binary compatibility, and LLM-guided code translation.

Anthology ID:: 2026.acl-long.1592
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 34489–34508
Language:
URL:: https://aclanthology.org/2026.acl-long.1592/
DOI:
Bibkey:
Cite (ACL):: Ahmed Heakl, Gustavo Bertolo Stahl, Sarim Hashmi, Seung Hun Eddie Han, Mukul Ranjan, Arina Kharlamova, Salman Khan, and Abdulrahman Mahmoud. 2026. CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 34489–34508, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark (Heakl et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1592.pdf
Checklist:: 2026.acl-long.1592.checklist.pdf

PDF Cite Search Checklist Fix data