Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models Natalie Shapira author Mosh Levy author Seyed Hossein Alavi author Xuhui Zhou author Yejin Choi author Yoav Goldberg author Maarten Sap author Vered Shwartz author 2024-03 text Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers) Yvette Graham editor Matthew Purver editor Association for Computational Linguistics St. Julian’s, Malta conference publication shapira-etal-2024-clever 10.18653/v1/2024.eacl-long.138 https://aclanthology.org/2024.eacl-long.138/ 2024-03 2257 2273