Lightweight Haar Wavelet Subband Pruning for LLMs

Jiang Li; Pengfei Cao (鹏飞 曹); Chenxi Zhou; Tian Lan; Xiangdong Su; Kang Liu; Jun Zhao; Guanglai Gao

Lightweight Haar Wavelet Subband Pruning for LLMs

Jiang Li, Pengfei Cao, Chenxi Zhou, Tian Lan, Xiangdong Su, Kang Liu, Jun Zhao, Guanglai Gao

Abstract

Large language models (LLMs) reach state-of-the-art performance across many NLP tasks, but their large parameter counts introduce heavy computational and memory overhead, which complicates deployment in resource-constrained settings. Pruning is a standard compression strategy that induces sparsity to lower these costs. However, most pruning methods for LLMs depend on calibration data and expensive weight updates, which limits practical scalability. To address these limitations, we introduce Haar Wavelet Subband Pruning (), a post-training framework that requires no calibration data and no weight updates. applies a two-dimensional Haar wavelet transform to each weight matrix and decomposes it into four frequency subbands. It then assigns a uniform sparsity ratio to all subbands so that both low- and high-frequency components are retained in a balanced manner. Our theoretical analysis shows that the subband design of provides a deterministic per-subband retention guarantee, which helps mitigate the potential bias of global magnitude pruning toward dominant frequency components. Experiments on the LLaMA, OPT and Qwen model families show that achieves competitive accuracy relative to strong pruning baselines while substantially reducing pruning time. Compared with magnitude pruning, which serves as a simple calibration-free baseline, generally achieves better downstream performance across a wide range of sparsity levels and model scales.

Anthology ID:: 2026.findings-acl.798
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 16242–16259
Language:
URL:: https://aclanthology.org/2026.findings-acl.798/
DOI:
Bibkey:
Cite (ACL):: Jiang Li, Pengfei Cao, Chenxi Zhou, Tian Lan, Xiangdong Su, Kang Liu, Jun Zhao, and Guanglai Gao. 2026. Lightweight Haar Wavelet Subband Pruning for LLMs. In Findings of the Association for Computational Linguistics: ACL 2026, pages 16242–16259, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Lightweight Haar Wavelet Subband Pruning for LLMs (Li et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.798.pdf
Checklist:: 2026.findings-acl.798.checklist.pdf

PDF Cite Search Checklist Fix data