Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation

Feilong Chen, Fandong Meng, Xiuyi Chen, Peng Li, Jie Zhou


Anthology ID:
2021.findings-acl.38
Volume:
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
Month:
August
Year:
2021
Address:
Online
Editors:
Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
436–446
Language:
URL:
https://aclanthology.org/2021.findings-acl.38
DOI:
10.18653/v1/2021.findings-acl.38
Bibkey:
Cite (ACL):
Feilong Chen, Fandong Meng, Xiuyi Chen, Peng Li, and Jie Zhou. 2021. Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 436–446, Online. Association for Computational Linguistics.
Cite (Informal):
Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation (Chen et al., Findings 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.findings-acl.38.pdf
Code
 zyang-ur/onestage_grounding
Data
VisDialVisual Genome