Mechanistic Analysis Of Universality: Numerical Comparison Circuits Across Transformer Architectures

Arya Bhardia; Julian Ramirez; Siddhanta Verma; Karen Mkrtchyan

Mechanistic Analysis Of Universality: Numerical Comparison Circuits Across Transformer Architectures

Arya Bhardia, Julian Ramirez, Siddhanta Verma, Karen Mkrtchyan

Abstract

Transformer language models reliably achieve high accuracy on many reasoning tasks; however, their internal mechanisms are not fully understood. Mechanistic interpretability seeks to remedy this gap by identifying task circuits within individual models, but it is unclear whether such circuits generalize across model families and scales. In this work, we study the universality of circuits through the lens of numerical comparisons, a simple and controlled task that enables clean and causal interventions. We conduct experiments on a set of transformer models spanning different families and sizes from 1.7b to 9b parameters. We find that models within the Qwen family exhibit a highly consistent circuit structure across architecture and scale, featuring localized attention heads that write a task relevant signal. In contrast, models from other families show qualitatively different implementations, where task relevant information emerges much earlier and is distributed across components as opposed to being concentrated within a small set of attention heads. These results serve as evidence that task behavior similarities do not imply mechanistic universality and highlight the necessity for cross model comparisons to claim generalization of internal circuits.

Anthology ID:: 2026.acl-srw.84
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Santosh T.Y.S.S., Juan Diego Rodriguez, Ona de Gibert
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 951–967
Language:
URL:: https://aclanthology.org/2026.acl-srw.84/
DOI:
Bibkey:
Cite (ACL):: Arya Bhardia, Julian Ramirez, Siddhanta Verma, and Karen Mkrtchyan. 2026. Mechanistic Analysis Of Universality: Numerical Comparison Circuits Across Transformer Architectures. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 951–967, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Mechanistic Analysis Of Universality: Numerical Comparison Circuits Across Transformer Architectures (Bhardia et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-srw.84.pdf

PDF Cite Search Fix data