Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark

Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark Nikita Nangia author Samuel R Bowman author 2019-07 text Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics Anna Korhonen editor David Traum editor Lluís Màrquez editor Association for Computational Linguistics Florence, Italy conference publication nangia-bowman-2019-human 10.18653/v1/P19-1449 https://aclanthology.org/P19-1449/ 2019-07 4566 4575