Category-Based Strategy-Driven Question Generator for Visual Dialogue

Shi Yanan; Tan Yanxin; Feng Fangxiang; Zheng Chunping; Wang Xiaojie

Category-Based Strategy-Driven Question Generator for Visual Dialogue

Shi Yanan, Tan Yanxin, Feng Fangxiang, Zheng Chunping, Wang Xiaojie

Abstract

GuessWhat?! is a task-oriented visual dialogue task which has two players a guesser and anoracle. Guesser aims to locate the object supposed by oracle by asking several Yes/No questions which are answered by oracle. How to ask proper questions is crucial to achieve the final goal of the whole task. Previous methods generally use an word-level generator which is hard to grasp the dialogue-level questioning strategy. They often generate repeated or useless questions. This paper proposes a sentence-level category-based strategy-driven question generator(CSQG) to explicitly provide a category based questioning strategy for the generator. First we encode the image and the dialogue history to decide the category of the next question to be generated. Thenthe question is generated with the helps of category-based dialogue strategy as well as encoding of both the image and dialogue history. The evaluation on large-scale visual dialogue dataset GuessWhat?! shows that our method can help guesser achieve 51.71% success rate which is the state-of-the-art on the supervised training methods.

Anthology ID:: 2021.ccl-1.89
Volume:: Proceedings of the 20th Chinese National Conference on Computational Linguistics
Month:: August
Year:: 2021
Address:: Huhhot, China
Editors:: Sheng Li (李生), Maosong Sun (孙茂松), Yang Liu (刘洋), Hua Wu (吴华), Kang Liu (刘康), Wanxiang Che (车万翔), Shizhu He (何世柱), Gaoqi Rao (饶高琦)
Venue:: CCL
SIG:
Publisher:: Chinese Information Processing Society of China
Note:
Pages:: 1000–1011
Language:: English
URL:: https://aclanthology.org/2021.ccl-1.89/
DOI:
Bibkey:
Cite (ACL):: Shi Yanan, Tan Yanxin, Feng Fangxiang, Zheng Chunping, and Wang Xiaojie. 2021. Category-Based Strategy-Driven Question Generator for Visual Dialogue. In Proceedings of the 20th Chinese National Conference on Computational Linguistics, pages 1000–1011, Huhhot, China. Chinese Information Processing Society of China.
Cite (Informal):: Category-Based Strategy-Driven Question Generator for Visual Dialogue (Yanan et al., CCL 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.ccl-1.89.pdf

PDF Cite Search Fix data