Cultural Concept Adaptation on Multimodal Reasoning

Zhi Li; Yin Zhang

doi:10.18653/v1/2023.emnlp-main.18

Cultural Concept Adaptation on Multimodal Reasoning

Abstract

Developing cultural adaptation methods is important, which can improve the model performance on the low-resource ones and provide more equitable opportunities for everyone to benefit from advanced technology. Past methods primarily focused on multilingual and multimodal capabilities, and the improvement of multicultural competence is still an unexplored problem. This is largely due to the difficulty of data scarcity and expensive annotation. In this paper, we navigate this uncharted territory by leveraging high-resource cultures to facilitate comprehension of low-resource ones. We first introduce an annotation-free method for cultural-concept adaptation and construct a concept mapping set. To facilitate the model’s comprehension of cultural-concept mappings, we propose a new multimodal data augmentation called CultureMixup. This approach employs a three-tier code-switching strategy on textual sentences. Additionally, it uses a cultural concept-based mixup method for the images. This combination effectively generates new data instances across culture, phrase, word, and image levels. For visually grounded reasoning across languages and cultures, experimental results on five languages show that our method consistently improves performance for four existing multilingual and multimodal models on both zero-shot and few-shot settings.

Anthology ID:: 2023.emnlp-main.18
Volume:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Houda Bouamor, Juan Pino, Kalika Bali
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 262–276
Language:
URL:: https://aclanthology.org/2023.emnlp-main.18/
DOI:: 10.18653/v1/2023.emnlp-main.18
Bibkey:
Cite (ACL):: Zhi Li and Yin Zhang. 2023. Cultural Concept Adaptation on Multimodal Reasoning. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 262–276, Singapore. Association for Computational Linguistics.
Cite (Informal):: Cultural Concept Adaptation on Multimodal Reasoning (Li & Zhang, EMNLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.emnlp-main.18.pdf
Video:: https://aclanthology.org/2023.emnlp-main.18.mp4

PDF Cite Search Video Fix data