Investigating More Explainable and Partition-Free Compositionality Estimation for LLMs: A Rule-Generation Perspective

Ziyao Xu; Cong Wang; Houfeng Wang

Investigating More Explainable and Partition-Free Compositionality Estimation for LLMs: A Rule-Generation Perspective

Abstract

Compositional generalization tests are often used to estimate the compositionality of LLMs. However, such tests have the following limitations: (1) they only focus on the output results without considering LLMs’ understanding of sample compositionality, resulting in explainability defects; (2) they rely on dataset partition to form the test set with combinations unseen in the training set, suffering from combination leakage issues. In this work, we propose a novel rule-generation perspective for compositionality estimation for LLMs. It requires LLMs to generate a program as rules for dataset mapping and provides estimates of the compositionality of LLMs using complexity-based theory. The perspective addresses the limitations of compositional generalization tests and provides a new way to analyze the compositionality characterization of LLMs. We conduct experiments and analysis of existing advanced LLMs based on this perspective on a string-to-grid task, and find various compositionality characterizations and compositionality deficiencies exhibited by LLMs.

Anthology ID:: 2026.acl-long.409
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9043–9060
Language:
URL:: https://aclanthology.org/2026.acl-long.409/
DOI:
Bibkey:
Cite (ACL):: Ziyao Xu, Cong Wang, and Houfeng Wang. 2026. Investigating More Explainable and Partition-Free Compositionality Estimation for LLMs: A Rule-Generation Perspective. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 9043–9060, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Investigating More Explainable and Partition-Free Compositionality Estimation for LLMs: A Rule-Generation Perspective (Xu et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.409.pdf
Checklist:: 2026.acl-long.409.checklist.pdf

PDF Cite Search Checklist Fix data