Pretraining on Interactions for Learning Grounded Affordance Representations

Jack Merullo; Dylan Ebert; Carsten Eickhoff; Ellie Pavlick

doi:10.18653/v1/2022.starsem-1.23

Pretraining on Interactions for Learning Grounded Affordance Representations

Jack Merullo, Dylan Ebert, Carsten Eickhoff, Ellie Pavlick

Abstract

Lexical semantics and cognitive science point to affordances (i.e. the actions that objects support) as critical for understanding and representing nouns and verbs. However, study of these semantic features has not yet been integrated with the ?foundation? models that currently dominate language representation research. We hypothesize that predictive modeling of object state over time will result in representations that encode object affordance information ?for free?. We train a neural network to predict objects? trajectories in a simulated interaction and show that our network?s latent representations differentiate between both observed and unobserved affordances. We find that models trained using 3D simulations outperform conventional 2D computer vision models trained on a similar task, and, on initial inspection, that differences between concepts correspond to expected features (e.g., roll entails rotation) . Our results suggest a way in which modern deep learning approaches to grounded language learning can be integrated with traditional formal semantic notions of lexical representations.

Anthology ID:: 2022.starsem-1.23
Volume:: Proceedings of the 11th Joint Conference on Lexical and Computational Semantics
Month:: July
Year:: 2022
Address:: Seattle, Washington
Editors:: Vivi Nastase, Ellie Pavlick, Mohammad Taher Pilehvar, Jose Camacho-Collados, Alessandro Raganato
Venue:: *SEM
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 258–277
Language:
URL:: https://aclanthology.org/2022.starsem-1.23
DOI:: 10.18653/v1/2022.starsem-1.23
Bibkey:
Cite (ACL):: Jack Merullo, Dylan Ebert, Carsten Eickhoff, and Ellie Pavlick. 2022. Pretraining on Interactions for Learning Grounded Affordance Representations. In Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, pages 258–277, Seattle, Washington. Association for Computational Linguistics.
Cite (Informal):: Pretraining on Interactions for Learning Grounded Affordance Representations (Merullo et al., *SEM 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.starsem-1.23.pdf
Code: jmerullo/affordances

PDF Cite Search Code