CoopQ: Cooperative Game Inspired Layerwise Mixed Precision Quantization for LLMs

Junchen Zhao; Ali Derakhshan; Jayden Hyman; Junhao Dong; Sangeetha Abdu Jyothi; Ian Harris

CoopQ: Cooperative Game Inspired Layerwise Mixed Precision Quantization for LLMs

Junchen Zhao, Ali Derakhshan, Jayden Hyman, Junhao Dong, Sangeetha Abdu Jyothi, Ian Harris

Abstract

Large Language Models (LLMs) promise impressive capabilities, yet their multi-billion parameter scale makes on-device or low-resource deployment prohibitive. Mixed precision quantization offers a compelling solution, but existing methods struggle when the average precision drops below four bits, as they rely on isolated, layer-specific metrics that overlook critical inter-layer interactions affecting overall performance. To address these limitations, we first frame the mixed-precision quantization problem as a cooperative game among layers and introduce Shapley-based Progressive Quantization Estimation (SPQE) to efficiently obtain accurate Shapley estimates of layer sensitivities and inter-layer interactions. Leveraging the SPQE estimates, we propose Cooperative Game Inspired Mixed-Precision Quantization (CoopQ) which translates these Shapley estimates into a binary quadratic optimization formulation, assigning either 2 or 4-bit precision to layers under strict memory constraints. Comprehensive experiments conducted on Llama-3, Gemma-2, and Qwen models across three independent PTQ backends (Quanto, HQQ, GPTQ) demonstrate CoopQ’s scalability and consistently superior performance compared to methods relying solely on isolated metrics. Across average precisions spanning 4 bit down to 2 bit, CoopQ cuts Perplexity by 20 – 80 % relative to the best baseline, with the margin growing as the bit-width tightens.

Anthology ID:: 2026.findings-acl.373
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7566–7578
Language:
URL:: https://aclanthology.org/2026.findings-acl.373/
DOI:
Bibkey:
Cite (ACL):: Junchen Zhao, Ali Derakhshan, Jayden Hyman, Junhao Dong, Sangeetha Abdu Jyothi, and Ian Harris. 2026. CoopQ: Cooperative Game Inspired Layerwise Mixed Precision Quantization for LLMs. In Findings of the Association for Computational Linguistics: ACL 2026, pages 7566–7578, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: CoopQ: Cooperative Game Inspired Layerwise Mixed Precision Quantization for LLMs (Zhao et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.373.pdf
Checklist:: 2026.findings-acl.373.checklist.pdf

PDF Cite Search Checklist Fix data