LexSym: Compositionality as Lexical Symmetry

Ekin Akyurek, Jacob Andreas


Abstract
In tasks like semantic parsing, instruction following, and question answering, standard deep networks fail to generalize compositionally from small datasets. Many existing approaches overcome this limitation with model architectures that enforce a compositional process of sentence interpretation. In this paper, we present a domain-general and model-agnostic formulation of compositionality as a constraint on symmetries of data distributions rather than models. Informally, we prove that whenever a task can be solved by a compositional model, there is a corresponding data augmentation scheme — a procedure for transforming examples into other well-formed examples — that imparts compositional inductive bias on any model trained to solve the same task. We describe a procedure called LexSym that discovers these transformations automatically, then applies them to training data for ordinary neural sequence models. Unlike existing compositional data augmentation procedures, LexSym can be deployed agnostically across text, structured data, and even images. It matches or surpasses state-of-the-art, task-specific models on COGS semantic parsing, SCAN and Alchemy instruction following, and CLEVR-CoGenT visual question answering datasets.
Anthology ID:
2023.acl-long.38
Volume:
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
639–657
Language:
URL:
https://aclanthology.org/2023.acl-long.38
DOI:
10.18653/v1/2023.acl-long.38
Award:
 Area Chair Award (Semantics: Lexical)
Bibkey:
Cite (ACL):
Ekin Akyurek and Jacob Andreas. 2023. LexSym: Compositionality as Lexical Symmetry. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 639–657, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
LexSym: Compositionality as Lexical Symmetry (Akyurek & Andreas, ACL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.acl-long.38.pdf