Counterfactual Simulatability of LLM Explanations for Generation Tasks

Marvin Limpijankit; Yanda Chen; Melanie Subbiah; Nicholas Deas; Kathleen McKeown

Counterfactual Simulatability of LLM Explanations for Generation Tasks

Marvin Limpijankit, Yanda Chen, Melanie Subbiah, Nicholas Deas, Kathleen McKeown

Abstract

LLMs can be unpredictable, as even slight alterations to the prompt can cause the output to change in unexpected ways. Thus, the ability of models to accurately explain their behavior is critical, especially in high-stakes settings. Counterfactual simulatability measures how well an explanation allows users to infer the model’s output on related counterfactuals and has been previously studied for yes/no question answering. We provide a general framework for extending this method to generation tasks, using news summarization and medical suggestion as example use cases. We find that while LLM explanations do enable users to better predict their outputs on counterfactuals in the summarization setting, there is significant room for improvement for medical suggestion. Furthermore, our results suggest that evaluating counterfactual simulatability may be more appropriate for skill-based tasks as opposed to knowledge-based tasks.

Anthology ID:: 2025.inlg-main.38
Volume:: Proceedings of the 18th International Natural Language Generation Conference
Month:: October
Year:: 2025
Address:: Hanoi, Vietnam
Editors:: Lucie Flek, Shashi Narayan, Lê Hồng Phương, Jiahuan Pei
Venue:: INLG
SIG:: SIGGEN
Publisher:: Association for Computational Linguistics
Note:
Pages:: 659–683
Language:
URL:: https://aclanthology.org/2025.inlg-main.38/
DOI:
Bibkey:
Cite (ACL):: Marvin Limpijankit, Yanda Chen, Melanie Subbiah, Nicholas Deas, and Kathleen McKeown. 2025. Counterfactual Simulatability of LLM Explanations for Generation Tasks. In Proceedings of the 18th International Natural Language Generation Conference, pages 659–683, Hanoi, Vietnam. Association for Computational Linguistics.
Cite (Informal):: Counterfactual Simulatability of LLM Explanations for Generation Tasks (Limpijankit et al., INLG 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.inlg-main.38.pdf

PDF Cite Search Fix data