%0 Conference Proceedings %T Proceedings of the Third Workshop on Multimodal Artificial Intelligence %E Zadeh, Amir %E Morency, Louis-Philippe %E Liang, Paul Pu %E Ross, Candace %E Salakhutdinov, Ruslan %E Poria, Soujanya %E Cambria, Erik %E Shi, Kelly %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F maiworkshop-2021-multimodal %U https://aclanthology.org/2021.maiworkshop-1.0 %0 Conference Proceedings %T Multimodal Weighted Fusion of Transformers for Movie Genre Classification %A Rodríguez Bribiesca, Isaac %A López Monroy, Adrián Pastor %A Montes-y-Gómez, Manuel %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F rodriguez-bribiesca-etal-2021-multimodal %R 10.18653/v1/2021.maiworkshop-1.1 %U https://aclanthology.org/2021.maiworkshop-1.1 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.1 %P 1-5 %0 Conference Proceedings %T On Randomized Classification Layers and Their Implications in Natural Language Generation %A Shalev, Gal-Lev %A Shalev, Gabi %A Keshet, Joseph %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F shalev-etal-2021-randomized %R 10.18653/v1/2021.maiworkshop-1.2 %U https://aclanthology.org/2021.maiworkshop-1.2 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.2 %P 6-11 %0 Conference Proceedings %T COIN: Conversational Interactive Networks for Emotion Recognition in Conversation %A Zhang, Haidong %A Chai, Yekun %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F zhang-chai-2021-coin %R 10.18653/v1/2021.maiworkshop-1.3 %U https://aclanthology.org/2021.maiworkshop-1.3 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.3 %P 12-18 %0 Conference Proceedings %T A First Look: Towards Explainable TextVQA Models via Visual and Textual Explanations %A Nagaraj Rao, Varun %A Zhen, Xingjian %A Hovsepian, Karen %A Shen, Mingwei %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F nagaraj-rao-etal-2021-first %R 10.18653/v1/2021.maiworkshop-1.4 %U https://aclanthology.org/2021.maiworkshop-1.4 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.4 %P 19-29 %0 Conference Proceedings %T Multi Task Learning based Framework for Multimodal Classification %A Zeng, Danting %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F zeng-2021-multi %R 10.18653/v1/2021.maiworkshop-1.5 %U https://aclanthology.org/2021.maiworkshop-1.5 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.5 %P 30-35 %0 Conference Proceedings %T Validity-Based Sampling and Smoothing Methods for Multiple Reference Image Captioning %A Nagasawa, Shunta %A Watanabe, Yotaro %A Iyatomi, Hitoshi %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F nagasawa-etal-2021-validity %R 10.18653/v1/2021.maiworkshop-1.6 %U https://aclanthology.org/2021.maiworkshop-1.6 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.6 %P 36-41 %0 Conference Proceedings %T Modality-specific Distillation %A Jin, Woojeong %A Sanjabi, Maziar %A Nie, Shaoliang %A Tan, Liang %A Ren, Xiang %A Firooz, Hamed %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F jin-etal-2021-modality %R 10.18653/v1/2021.maiworkshop-1.7 %U https://aclanthology.org/2021.maiworkshop-1.7 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.7 %P 42-53 %0 Conference Proceedings %T Cold Start Problem For Automated Live Video Comments %A Wu, Hao %A Pitie, François %A Jones, Gareth %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F wu-etal-2021-cold %R 10.18653/v1/2021.maiworkshop-1.8 %U https://aclanthology.org/2021.maiworkshop-1.8 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.8 %P 54-62 %0 Conference Proceedings %T !‘Qué maravilla! Multimodal Sarcasm Detection in Spanish: a Dataset and a Baseline %A Alnajjar, Khalid %A Hämäläinen, Mika %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F alnajjar-hamalainen-2021-que %R 10.18653/v1/2021.maiworkshop-1.9 %U https://aclanthology.org/2021.maiworkshop-1.9 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.9 %P 63-68 %0 Conference Proceedings %T A Package for Learning on Tabular and Text Data with Transformers %A Gu, Ken %A Budhkar, Akshay %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F gu-budhkar-2021-package %R 10.18653/v1/2021.maiworkshop-1.10 %U https://aclanthology.org/2021.maiworkshop-1.10 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.10 %P 69-73 %0 Conference Proceedings %T Semantic Aligned Multi-modal Transformer for Vision-LanguageUnderstanding: A Preliminary Study on Visual QA %A Ding, Han %A Li, Li Erran %A Hu, Zhiting %A Xu, Yi %A Hakkani-Tur, Dilek %A Du, Zheng %A Zeng, Belinda %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F ding-etal-2021-semantic %R 10.18653/v1/2021.maiworkshop-1.11 %U https://aclanthology.org/2021.maiworkshop-1.11 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.11 %P 74-78 %0 Conference Proceedings %T GraghVQA: Language-Guided Graph Neural Networks for Graph-based Visual Question Answering %A Liang, Weixin %A Jiang, Yanhao %A Liu, Zixuan %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F liang-etal-2021-graghvqa %R 10.18653/v1/2021.maiworkshop-1.12 %U https://aclanthology.org/2021.maiworkshop-1.12 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.12 %P 79-86 %0 Conference Proceedings %T Learning to Select Question-Relevant Relations for Visual Question Answering %A Lee, Jaewoong %A Lee, Heejoon %A Lee, Hwanhee %A Jung, Kyomin %Y Zadeh, Amir %Y Morency, Louis-Philippe %Y Liang, Paul Pu %Y Ross, Candace %Y Salakhutdinov, Ruslan %Y Poria, Soujanya %Y Cambria, Erik %Y Shi, Kelly %S Proceedings of the Third Workshop on Multimodal Artificial Intelligence %D 2021 %8 June %I Association for Computational Linguistics %C Mexico City, Mexico %F lee-etal-2021-learning %R 10.18653/v1/2021.maiworkshop-1.13 %U https://aclanthology.org/2021.maiworkshop-1.13 %U https://doi.org/10.18653/v1/2021.maiworkshop-1.13 %P 87-96