Understanding the Impact of Experiment Design for Evaluating Dialogue System Output Sashank Santhanam author Samira Shaikh author 2020-07 text Proceedings of the Fourth Widening Natural Language Processing Workshop Rossana Cunha editor Samira Shaikh editor Erika Varis editor Ryan Georgi editor Alicia Tsai editor Antonios Anastasopoulos editor Khyathi Raghavi Chandu editor Association for Computational Linguistics Seattle, USA conference publication santhanam-shaikh-2020-understanding 10.18653/v1/2020.winlp-1.33 https://aclanthology.org/2020.winlp-1.33/ 2020-07 124 127