%0 Conference Proceedings %T COIN: Conversational Interactive Networks for Emotion Recognition in Conversation %A Zhang, Haidong %A Chai, Yekun %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F zhang-chai-2021-coin %X Emotion recognition in conversation has received considerable attention recently because of its practical industrial applications. Existing methods tend to overlook the immediate mutual interaction between different speakers in the speaker-utterance level, or apply single speaker-agnostic RNN for utterances from different speakers. We propose COIN, a conversational interactive model to mitigate this problem by applying state mutual interaction within history contexts. In addition, we introduce a stacked global interaction module to capture the contextual and inter-dependency representation in a hierarchical manner. To improve the robustness and generalization during training, we generate adversarial examples by applying the minor perturbations on multimodal feature inputs, unveiling the benefits of adversarial examples for emotion detection. The proposed model empirically achieves the current state-of-the-art results on the IEMOCAP benchmark dataset. %R 10.18653/v1/2021.maiworkshop-1.3 %U https://aclanthology.org/2021.maiworkshop-1.3 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.3 %P 12-18