Grounded and well-rounded: a methodological approach to the study of cross-modal and cross-lingual grounding

Timothee Mickus, Elaine Zosa, Denis Paperno


Abstract
Grounding has been argued to be a crucial component towards the development of more complete and truly semantically competent artificial intelligence systems. Literature has divided into two camps: While some argue that grounding allows for qualitatively different generalizations, others believe it can be compensated by mono-modal data quantity. Limited empirical evidence has emerged for or against either position, which we argue is due to the methodological challenges that come with studying grounding and its effects on NLP systems. In this paper, we establish a methodological framework for studying what the effects are—if any—of providing models with richer input sources than text-only. The crux of it lies in the construction of comparable samples of populations of models trained on different input modalities, so that we can tease apart the qualitative effects of different input sources from quantifiable model performances. Experiments using this framework reveal qualitative differences in model behavior between cross-modally grounded, cross-lingually grounded, and ungrounded models, which we measure both at a global dataset level as well as for specific word representations, depending on how concrete their semantics is.
Anthology ID:
2023.findings-emnlp.736
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2023
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
11031–11042
Language:
URL:
https://aclanthology.org/2023.findings-emnlp.736
DOI:
10.18653/v1/2023.findings-emnlp.736
Bibkey:
Cite (ACL):
Timothee Mickus, Elaine Zosa, and Denis Paperno. 2023. Grounded and well-rounded: a methodological approach to the study of cross-modal and cross-lingual grounding. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 11031–11042, Singapore. Association for Computational Linguistics.
Cite (Informal):
Grounded and well-rounded: a methodological approach to the study of cross-modal and cross-lingual grounding (Mickus et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-emnlp.736.pdf