FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks

Tanawan Premsri; Parisa Kordjamshidi

doi:10.18653/v1/2025.emnlp-main.1772

FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks

Abstract

Spatial reasoning is a fundamental aspect of human intelligence. One key concept in spatial cognition is the Frame of Reference (FoR), which identifies the perspective of spatial expressions. Despite its significance, FoR has received limited attention in AI models that need spatial intelligence. There is a lack of dedicated benchmarks and in-depth evaluation of large language models (LLMs) in this area. To address this issue, we introduce the Frame of Reference Evaluation in Spatial Reasoning Tasks (FoREST) benchmark, designed to assess FoR comprehension in LLMs. We evaluate LLMs on answering questions that require FoR comprehension and layout generation in text-to-image models using FoREST. Our results reveal a notable performance gap across different FoR classes in various LLMs, affecting their ability to generate accurate layouts for text-to-image generation. This highlights critical shortcomings in FoR comprehension. To improve FoR understanding, we propose Spatial-Guided prompting, which improves LLMs’ ability to extract essential spatial concepts. Our proposed method improves overall performance across spatial reasoning tasks.

Anthology ID:: 2025.emnlp-main.1772
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 34977–35003
Language:
URL:: https://aclanthology.org/2025.emnlp-main.1772/
DOI:: 10.18653/v1/2025.emnlp-main.1772
Bibkey:
Cite (ACL):: Tanawan Premsri and Parisa Kordjamshidi. 2025. FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 34977–35003, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks (Premsri & Kordjamshidi, EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.1772.pdf
Checklist:: 2025.emnlp-main.1772.checklist.pdf

PDF Cite Search Checklist Fix data