Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog Jiaping Zhang author Tiancheng Zhao author Zhou Yu author 2018-07 text Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue Kazunori Komatani editor Diane Litman editor Kai Yu editor Alex Papangelis editor Lawrence Cavedon editor Mikio Nakano editor Association for Computational Linguistics Melbourne, Australia conference publication zhang-etal-2018-multimodal 10.18653/v1/W18-5015 https://aclanthology.org/W18-5015/ 2018-07 140 150