CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions

Tayfun Ates; M. Ateşoğlu; Çağatay Yiğit; Ilker Kesen; Mert Kobas; Erkut Erdem; Aykut Erdem; Tilbe Goksun; Deniz Yuret

doi:10.18653/v1/2022.findings-acl.205

CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions

Tayfun Ates, M. Ateşoğlu, Çağatay Yiğit, Ilker Kesen, Mert Kobas, Erkut Erdem, Aykut Erdem, Tilbe Goksun, Deniz Yuret

Abstract

Humans are able to perceive, understand and reason about causal events. Developing models with similar physical and causal understanding capabilities is a long-standing goal of artificial intelligence. As a step towards this direction, we introduce CRAFT, a new video question answering dataset that requires causal reasoning about physical forces and object interactions. It contains 58K video and question pairs that are generated from 10K videos from 20 different virtual environments, containing various objects in motion that interact with each other and the scene. Two question categories in CRAFT include previously studied descriptive and counterfactual questions. Additionally, inspired by the Force Dynamics Theory in cognitive linguistics, we introduce a new causal question category that involves understanding the causal interactions between objects through notions like cause, enable, and prevent. Our results show that even though the questions in CRAFT are easy for humans, the tested baseline models, including existing state-of-the-art methods, do not yet deal with the challenges posed in our benchmark.

Anthology ID:: 2022.findings-acl.205
Volume:: Findings of the Association for Computational Linguistics: ACL 2022
Month:: May
Year:: 2022
Address:: Dublin, Ireland
Editors:: Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2602–2627
Language:
URL:: https://aclanthology.org/2022.findings-acl.205/
DOI:: 10.18653/v1/2022.findings-acl.205
Bibkey:
Cite (ACL):: Tayfun Ates, M. Ateşoğlu, Çağatay Yiğit, Ilker Kesen, Mert Kobas, Erkut Erdem, Aykut Erdem, Tilbe Goksun, and Deniz Yuret. 2022. CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions. In Findings of the Association for Computational Linguistics: ACL 2022, pages 2602–2627, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):: CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions (Ates et al., Findings 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.findings-acl.205.pdf
Video:: https://aclanthology.org/2022.findings-acl.205.mp4

PDF Cite Search Video Fix data