Pretrained Transformers Improve Out-of-Distribution Robustness Dan Hendrycks author Xiaoyuan Liu author Eric Wallace author Adam Dziedzic author Rishabh Krishnan author Dawn Song author 2020-07 text Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics Dan Jurafsky editor Joyce Chai editor Natalie Schluter editor Joel Tetreault editor Association for Computational Linguistics Online conference publication hendrycks-etal-2020-pretrained 10.18653/v1/2020.acl-main.244 https://aclanthology.org/2020.acl-main.244/ 2020-07 2744 2751