Interpretable Coreference Resolution Evaluation Using Explicit Semantics

Bruno Gatti; Giuliano Martinelli; Roberto Navigli

Interpretable Coreference Resolution Evaluation Using Explicit Semantics

Bruno Gatti, Giuliano Martinelli, Roberto Navigli

Abstract

Coreference resolution is typically evaluated using aggregate statistical metrics such as CoNLL-F1, which measure structural overlap between predicted and gold clusters. While widely used, these metrics offer limited diagnostic insights, penalizing errors without revealing whether a system struggles with specific semantic categories, such as people, locations, or events, and making it difficult to interpret model capabilities or derive actionable improvements. We address this gap by introducing a semantically-enhanced evaluation framework for coreference resolution. Our approach overlays Concept and Named Entity Recognition (CNER) onto coreference outputs, assigning semantic labels to nominal mentions and propagating them to entire coreference clusters. This enables the computation of typed scores aimed at evaluating mention extraction and linking capabilities stratified by semantic class. Across our experiments on OntoNotes, LitBank, and PreCo, we show that our framework uncovers systematic weaknesses that remain obscured by aggregate metrics. Furthermore, we show that these diagnostics can be used to design targeted, low-cost data augmentation strategies, achieving measurable out-of-domain improvements.

Anthology ID:: 2026.acl-long.2126
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 45854–45872
Language:
URL:: https://aclanthology.org/2026.acl-long.2126/
DOI:
Bibkey:
Cite (ACL):: Bruno Gatti, Giuliano Martinelli, and Roberto Navigli. 2026. Interpretable Coreference Resolution Evaluation Using Explicit Semantics. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 45854–45872, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Interpretable Coreference Resolution Evaluation Using Explicit Semantics (Gatti et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.2126.pdf
Checklist:: 2026.acl-long.2126.checklist.pdf

PDF Cite Search Checklist Fix data