SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

Rowan Zellers; Yonatan Bisk; Roy Schwartz; Yejin Choi

doi:10.18653/v1/D18-1009

SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

Rowan Zellers, Yonatan Bisk, Roy Schwartz, Yejin Choi

Abstract

Given a partial description like “she opened the hood of the car,” humans can reason about the situation and anticipate what might come next (”then, she examined the engine”). In this paper, we introduce the task of grounded commonsense inference, unifying natural language inference and commonsense reasoning. We present SWAG, a new dataset with 113k multiple choice questions about a rich spectrum of grounded situations. To address the recurring challenges of the annotation artifacts and human biases found in many existing datasets, we propose Adversarial Filtering (AF), a novel procedure that constructs a de-biased dataset by iteratively training an ensemble of stylistic classifiers, and using them to filter the data. To account for the aggressive adversarial filtering, we use state-of-the-art language models to massively oversample a diverse set of potential counterfactuals. Empirical results demonstrate that while humans can solve the resulting inference problems with high accuracy (88%), various competitive models struggle on our task. We provide comprehensive analysis that indicates significant opportunities for future research.

Anthology ID:: D18-1009
Volume:: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Month:: October-November
Year:: 2018
Address:: Brussels, Belgium
Editors:: Ellen Riloff, David Chiang, Julia Hockenmaier, Jun’ichi Tsujii
Venue:: EMNLP
SIG:: SIGDAT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 93–104
Language:
URL:: https://aclanthology.org/D18-1009/
DOI:: 10.18653/v1/D18-1009
Bibkey:
Cite (ACL):: Rowan Zellers, Yonatan Bisk, Roy Schwartz, and Yejin Choi. 2018. SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 93–104, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):: SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference (Zellers et al., EMNLP 2018)
Copy Citation:
PDF:: https://aclanthology.org/D18-1009.pdf
Attachment:: D18-1009.Attachment.zip
Video:: https://aclanthology.org/D18-1009.mp4

PDF Cite Search Attachment Video Fix data