Evaluation of Response Generation Models: Shouldn’t It Be Shareable and Replicable?

Evaluation of Response Generation Models: Shouldn’t It Be Shareable and Replicable? Seyed Mahed Mousavi author Gabriel Roccabruna author Michela Lorandi author Simone Caldarella author Giuseppe Riccardi author 2022-12 text Proceedings of the Second Workshop on Natural Language Generation, Evaluation, and Metrics (GEM) Antoine Bosselut editor Khyathi Chandu editor Kaustubh Dhole editor Varun Gangal editor Sebastian Gehrmann editor Yacine Jernite editor Jekaterina Novikova editor Laura Perez-Beltrachini editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates (Hybrid) conference publication mousavi-etal-2022-evaluation 10.18653/v1/2022.gem-1.12 https://aclanthology.org/2022.gem-1.12/ 2022-12 136 147