Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation Weixin Liang author James Zou author Zhou Yu author 2020-07 text Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics Dan Jurafsky editor Joyce Chai editor Natalie Schluter editor Joel Tetreault editor Association for Computational Linguistics Online conference publication liang-etal-2020-beyond 10.18653/v1/2020.acl-main.126 https://aclanthology.org/2020.acl-main.126/ 2020-07 1363 1374