KnowShiftQA: How Robust are RAG Systems when Textbook Knowledge Shifts in K-12 Education?

Tianshi Zheng; Weihan Li; Jiaxin Bai; Weiqi Wang; Yangqiu Song

doi:10.18653/v1/2025.acl-short.16

KnowShiftQA: How Robust are RAG Systems when Textbook Knowledge Shifts in K-12 Education?

Tianshi Zheng, Weihan Li, Jiaxin Bai, Weiqi Wang, Yangqiu Song

Abstract

Retrieval-Augmented Generation (RAG) systems show remarkable potential as question answering tools in the K-12 Education domain, where knowledge is typically queried within the restricted scope of authoritative textbooks. However, discrepancies between these textbooks and the parametric knowledge inherent in Large Language Models (LLMs) can undermine the effectiveness of RAG systems. To systematically investigate RAG system robustness against such knowledge discrepancies, we introduce KnowShiftQA. This novel question answering dataset simulates these discrepancies by applying deliberate hypothetical knowledge updates to both answers and source documents, reflecting how textbook knowledge can shift. KnowShiftQA comprises 3,005 questions across five subjects, designed with a comprehensive question typology focusing on context utilization and knowledge integration. Our extensive experiments on retrieval and question answering performance reveal that most RAG systems suffer a substantial performance drop when faced with these knowledge discrepancies. Furthermore, questions requiring the integration of contextual (textbook) knowledge with parametric (LLM) knowledge pose a significant challenge to current LLMs.

Anthology ID:: 2025.acl-short.16
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 183–195
Language:
URL:: https://aclanthology.org/2025.acl-short.16/
DOI:: 10.18653/v1/2025.acl-short.16
Bibkey:
Cite (ACL):: Tianshi Zheng, Weihan Li, Jiaxin Bai, Weiqi Wang, and Yangqiu Song. 2025. KnowShiftQA: How Robust are RAG Systems when Textbook Knowledge Shifts in K-12 Education?. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 183–195, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: KnowShiftQA: How Robust are RAG Systems when Textbook Knowledge Shifts in K-12 Education? (Zheng et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-short.16.pdf

PDF Cite Search Fix data