Evaluating Humanities Theory Alignment in Large Language Models: Incremental Prompting and Statistical Assessment

Axel Pichler; Janis Pagel

Evaluating Humanities Theory Alignment in Large Language Models: Incremental Prompting and Statistical Assessment

Abstract

We propose a method to evaluate the extent to which an LLM’s observable input–output behavior aligns with established theories in the humanities and cultural studies. We instantiate the framework on three humanities theories—Davidson’s truth-conditional semantics, Lewis’s truth in fiction, and Iser’s concept of textual gaps—using a top-down, theory-driven black-box framework. Core assumptions of these theories are reconstructed into testable behavioral rules and assessed via controlled classification tasks with systematic prompt comparisons and significance testing. Our experiments show that theory-uninformed classification prompts generally outperform theory-enriched prompts in Lewis and Iser settings, while theory-informed prompts help in the Davidson task. Gemini Flash consistently achieves the highest scores across tasks and corpora, while the Iser gap detection task remains substantially harder than binary truth-conditional judgments. Statistical tests confirm robust prompt effects and the failure of basic prompts. However, model behavior under incremental theory exposure is unstable and architecture-dependent.

Anthology ID:: 2026.latechclfl-1.27
Volume:: Proceedings of the 10th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2026
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Diego Alves, Yuri Bizzoni, Stefania Degaetano-Ortlieb, Anna Kazantseva, Janis Pagel, Stan Szpakowicz
Venues:: LaTeCH-CLfL | WS
SIG:: SIGHUM
Publisher:: Association for Computational Linguistics
Note:
Pages:: 280–294
Language:
URL:: https://aclanthology.org/2026.latechclfl-1.27/
DOI:
Bibkey:
Cite (ACL):: Axel Pichler and Janis Pagel. 2026. Evaluating Humanities Theory Alignment in Large Language Models: Incremental Prompting and Statistical Assessment. In Proceedings of the 10th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2026, pages 280–294, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Evaluating Humanities Theory Alignment in Large Language Models: Incremental Prompting and Statistical Assessment (Pichler & Pagel, LaTeCH-CLfL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.latechclfl-1.27.pdf
Supplementarymaterial:: 2026.latechclfl-1.27.SupplementaryMaterial.txt

PDF Cite Search Supplementarymaterial Fix data