Feature Decomposition-Augmentation Network for Multimodal Sentiment Analysis

Dapeng Yin; Bingxuan Hou; Mengna Gao; Shuyue Zhu; Junli Wang

Feature Decomposition-Augmentation Network for Multimodal Sentiment Analysis

Dapeng Yin, Bingxuan Hou, Mengna Gao, Shuyue Zhu, Junli Wang

Abstract

Multimodal sentiment analysis identifies human emotional tendencies by analyzing text, visual, and auditory modalities. In most studies, the textual modality is usually considered to contain the most emotional information and is regarded as the dominant modality. Existing methods mostly map auxiliary modalities into a semantic space close to the dominant modality, which overly relies on the dominant modality. In this work, we propose a Feature Decomposition-Augmentation (FeaDA) framework, which aims to elevate the role of auxiliary modalities in multimodal data fusion. We first design a projector to decompose auxiliary modalities into partial features, which contain features for emotion judgment, and then utilize these decomposed features to guide the fusion process with KL loss, thereby enhancing the status of auxiliary modality fusion. To verify the effectiveness of our method, we conducted experiments on the CMU-MOSI, CMU-MOSEI, and CH-SIMS datasets. The experimental results show that our FeaDA framework outperforms mutilmodal sentiment analysis methods of the same type in main metrics. Our code is available at https://github.com/PowerLittleYin/FeaDA-main.

Anthology ID:: 2025.ijcnlp-long.6
Volume:: Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Month:: December
Year:: 2025
Address:: Mumbai, India
Editors:: Kentaro Inui, Sakriani Sakti, Haofen Wang, Derek F. Wong, Pushpak Bhattacharyya, Biplab Banerjee, Asif Ekbal, Tanmoy Chakraborty, Dhirendra Pratap Singh
Venues:: IJCNLP | AACL
SIG:
Publisher:: The Asian Federation of Natural Language Processing and The Association for Computational Linguistics
Note:
Pages:: 86–98
Language:
URL:: https://aclanthology.org/2025.ijcnlp-long.6/
DOI:
Bibkey:
Cite (ACL):: Dapeng Yin, Bingxuan Hou, Mengna Gao, Shuyue Zhu, and Junli Wang. 2025. Feature Decomposition-Augmentation Network for Multimodal Sentiment Analysis. In Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, pages 86–98, Mumbai, India. The Asian Federation of Natural Language Processing and The Association for Computational Linguistics.
Cite (Informal):: Feature Decomposition-Augmentation Network for Multimodal Sentiment Analysis (Yin et al., IJCNLP-AACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.ijcnlp-long.6.pdf

PDF Cite Search Fix data