Twenty Years of Confusion in Human Evaluation: NLG Needs Evaluation Sheets and Standardised Definitions David M Howcroft author Anya Belz author Miruna-Adriana Clinciu author Dimitra Gkatzia author Sadid A Hasan author Saad Mahamood author Simon Mille author Emiel van Miltenburg author Sashank Santhanam author Verena Rieser author 2020-12 text Proceedings of the 13th International Conference on Natural Language Generation Brian Davis editor Yvette Graham editor John Kelleher editor Yaji Sripada editor Association for Computational Linguistics Dublin, Ireland conference publication howcroft-etal-2020-twenty 10.18653/v1/2020.inlg-1.23 https://aclanthology.org/2020.inlg-1.23/ 2020-12 169 182