%0 Conference Proceedings %T Twenty Years of Confusion in Human Evaluation: NLG Needs Evaluation Sheets and Standardised Definitions %A Howcroft, David M. %A Belz, Anya %A Clinciu, Miruna-Adriana %A Gkatzia, Dimitra %A Hasan, Sadid A. %A Mahamood, Saad %A Mille, Simon %A van Miltenburg, Emiel %A Santhanam, Sashank %A Rieser, Verena %Y Davis, Brian %Y Graham, Yvette %Y Kelleher, John %Y Sripada, Yaji %S Proceedings of the 13th International Conference on Natural Language Generation %D 2020 %8 December %I Association for Computational Linguistics %C Dublin, Ireland %F howcroft-etal-2020-twenty %R 10.18653/v1/2020.inlg-1.23 %U https://aclanthology.org/2020.inlg-1.23/ %U https://doi.org/10.18653/v1/2020.inlg-1.23 %P 169-182