Counterfactual Learning from Bandit Feedback under Deterministic Logging : A Case Study in Statistical Machine Translation Carolin Lawrence author Artem Sokolov author Stefan Riezler author 2017-09 text Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing Martha Palmer editor Rebecca Hwa editor Sebastian Riedel editor Association for Computational Linguistics Copenhagen, Denmark conference publication lawrence-etal-2017-counterfactual 10.18653/v1/D17-1272 https://aclanthology.org/D17-1272/ 2017-09 2566 2576