A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny

Karahan Sarıtaş; Çağatay Yıldız

doi:10.18653/v1/2025.acl-srw.11

A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny

Abstract

In this reproduction study, we revisit recent claims that self-attention implements kernel principal component analysis (KPCA) (Teo and Nguyen, 2024), positing that (i) value vectors V capture the eigenvectors of the Gram matrix of the keys, and (ii) that self-attention projects queries onto the principal component axes of the key matrix K in a feature space. Our analysis reveals three critical inconsistencies: (1) No alignment exists between learned self-attention value vectors and what is proposed in the KPCA perspective, with average similarity metrics (optimal cosine similarity ≤ 0.32, linear CKA (Centered Kernel Alignment) ≤ 0.11, kernel CKA ≤ 0.32) indicating negligible correspondence; (2) Reported decreases in reconstruction loss Jproj, arguably justifying the claim that the self-attentionminimizes the projection error of KPCA, are misinterpreted, as the quantities involved differ by orders of magnitude (∼ 10³); (3) Gram matrix eigenvalue statistics, introduced to justify that V captures the eigenvector of the gram matrix, are irreproducible without undocumented implementation-specific adjustments. Across 10 transformer architectures, we conclude that the KPCA interpretation of self-attention lacks empirical support.

Anthology ID:: 2025.acl-srw.11
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Jin Zhao, Mingyang Wang, Zhu Liu
Venues:: ACL | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 173–185
Language:
URL:: https://aclanthology.org/2025.acl-srw.11/
DOI:: 10.18653/v1/2025.acl-srw.11
Bibkey:
Cite (ACL):: Karahan Sarıtaş and Çağatay Yıldız. 2025. A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), pages 173–185, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny (Sarıtaş & Yıldız, ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-srw.11.pdf

PDF Cite Search Fix data