Maximum Margin Reward Networks for Learning from Explicit and Implicit Supervision Haoruo Peng author Ming-Wei Chang author Wen-tau Yih author 2017-09 text Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing Martha Palmer editor Rebecca Hwa editor Sebastian Riedel editor Association for Computational Linguistics Copenhagen, Denmark conference publication peng-etal-2017-maximum 10.18653/v1/D17-1252 https://aclanthology.org/D17-1252/ 2017-09 2368 2378