How reproducible is best-worst scaling for human evaluation? A reproduction of ‘Data-to-text Generation with Macro Planning’ Emiel van Miltenburg author Anouck Braggaar author Nadine Braun author Debby Damen author Martijn Goudbeek author Chris van der Lee author Frédéric Tomas author Emiel Krahmer author 2023-09 text Proceedings of the 3rd Workshop on Human Evaluation of NLP Systems Anya Belz editor Maja Popović editor Ehud Reiter editor Craig Thomson editor João Sedoc editor INCOMA Ltd., Shoumen, Bulgaria Varna, Bulgaria conference publication van-miltenburg-etal-2023-reproducible https://aclanthology.org/2023.humeval-1.7/ 2023-09 75 88