Alex Terentowicz

2025

How (un)faithful are explainable LLM-based NLG metrics?
Alex Terentowicz | Mateusz Lango | Ondrej Dusek
Proceedings of the 18th International Natural Language Generation Conference

Explainable NLG metrics are becoming a popular research topic; however, the faithfulness of the explanations they provide is typically not evaluated. In this work, we propose a testbed for assessing the faithfulness of span-based metrics by performing controlled perturbations of their explanations and observing changes in the final score. We show that several popular LLM evaluators do not consistently produce faithful explanations.

Co-authors

Ondřej Dušek 1
Mateusz Lango 1

Venues

inlg1

Fix author