Zouying Cao
2024
Head-wise Shareable Attention for Large Language Models
Zouying Cao
|
Yifei Yang
|
Hai Zhao
Findings of the Association for Computational Linguistics: EMNLP 2024
LaCo: Large Language Model Pruning via Layer Collapse
Yifei Yang
|
Zouying Cao
|
Hai Zhao
Findings of the Association for Computational Linguistics: EMNLP 2024
Search