PCR4ALL: A Comprehensive Evaluation Benchmark for Pronoun Coreference Resolution in English

Xinran Zhao, Hongming Zhang, Yangqiu Song


Abstract
Pronoun Coreference Resolution (PCR) is the task of resolving pronominal expressions to all mentions they refer to. The correct resolution of pronouns typically involves the complex inference over both linguistic knowledge and general world knowledge. Recently, with the help of pre-trained language representation models, the community has made significant progress on various PCR tasks. However, as most existing works focus on developing PCR models for specific datasets and measuring the accuracy or F1 alone, it is still unclear whether current PCR systems are reliable in real applications. Motivated by this, we propose PCR4ALL, a new benchmark and a toolbox that evaluates and analyzes the performance of PCR systems from different perspectives (i.e., knowledge source, domain, data size, frequency, relevance, and polarity). Experiments demonstrate notable performance differences when the models are examined from different angles. We hope that PCR4ALL can motivate the community to pay more attention to solving the overall PCR problem and understand the performance comprehensively. All data and codes are available at: https://github.com/HKUST-KnowComp/PCR4ALL.
Anthology ID:
2022.lrec-1.641
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5963–5973
Language:
URL:
https://aclanthology.org/2022.lrec-1.641
DOI:
Bibkey:
Cite (ACL):
Xinran Zhao, Hongming Zhang, and Yangqiu Song. 2022. PCR4ALL: A Comprehensive Evaluation Benchmark for Pronoun Coreference Resolution in English. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 5963–5973, Marseille, France. European Language Resources Association.
Cite (Informal):
PCR4ALL: A Comprehensive Evaluation Benchmark for Pronoun Coreference Resolution in English (Zhao et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.641.pdf
Code
 hkust-knowcomp/pcr4all
Data
CoNLL-2012Definite Pronoun Resolution DatasetWSCWinoGrande