Indic-TunedLens: Interpreting Multilingual Models in Indian Languages

Mihir Panchal; Deeksha Varshney; Mamta; Asif Ekbal

Indic-TunedLens: Interpreting Multilingual Models in Indian Languages

Mihir Panchal, Deeksha Varshney, Mamta ., Asif Ekbal

Abstract

Multilingual large language models (LLMs) are increasingly deployed in linguistically diverse regions like India, yet most interpretability tools remain tailored to English. Prior work reveals that LLMs often operate in English centric representation spaces, making cross lingual interpretability a pressing concern. We introduce Indic-TunedLens, a novel interpretability framework specifically for Indian languages that learns shared affine transformations. Unlike the standard Logit Lens, which directly decodes intermediate activations, Indic-TunedLens adjusts hidden states for each target language, aligning them with the target output distributions to enable more faithful decoding of model representations. We evaluate our framework on 10 Indian languages using the MMLU benchmark and find that it significantly improves over SOTA interpretability methods, especially for morphologically rich, low resource languages. Our results provide crucial insights into the layer-wise semantic encoding of multilingual transformers. Our model is available at https://huggingface.co/spaces/MihirRajeshPanchal/IndicTunedLens. Our code is available at https://github.com/MihirRajeshPanchal/IndicTunedLens.

Anthology ID:: 2026.vardial-1.14
Volume:: Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Venues:: VarDial | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 172–185
Language:
URL:: https://aclanthology.org/2026.vardial-1.14/
DOI:
Bibkey:
Cite (ACL):: Mihir Panchal, Deeksha Varshney, Mamta ., and Asif Ekbal. 2026. Indic-TunedLens: Interpreting Multilingual Models in Indian Languages. In Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects, pages 172–185, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Indic-TunedLens: Interpreting Multilingual Models in Indian Languages (Panchal et al., VarDial 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.vardial-1.14.pdf

PDF Cite Search Fix data