Learning When Not to Answer: a Ternary Reward Structure for Reinforcement Learning Based Question Answering

Learning When Not to Answer: a Ternary Reward Structure for Reinforcement Learning Based Question Answering Fréderic Godin author Anjishnu Kumar author Arpit Mittal author 2019-06 text Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Industry Papers) Anastassia Loukina editor Michelle Morales editor Rohit Kumar editor Association for Computational Linguistics Minneapolis, Minnesota conference publication godin-etal-2019-learning 10.18653/v1/N19-2016 https://aclanthology.org/N19-2016/ 2019-06 122 129