Anaphora Resolution with the ARRAU Corpus

Massimo Poesio, Yulia Grishina, Varada Kolhatkar, Nafise Moosavi, Ina Roesiger, Adam Roussel, Fabian Simonjetz, Alexandra Uma, Olga Uryupina, Juntao Yu, Heike Zinsmeister


Abstract
The ARRAU corpus is an anaphorically annotated corpus of English providing rich linguistic information about anaphora resolution. The most distinctive feature of the corpus is the annotation of a wide range of anaphoric relations, including bridging references and discourse deixis in addition to identity (coreference). Other distinctive features include treating all NPs as markables, including non-referring NPs; and the annotation of a variety of morphosyntactic and semantic mention and entity attributes, including the genericity status of the entities referred to by markables. The corpus however has not been extensively used for anaphora resolution research so far. In this paper, we discuss three datasets extracted from the ARRAU corpus to support the three subtasks of the CRAC 2018 Shared Task–identity anaphora resolution over ARRAU-style markables, bridging references resolution, and discourse deixis; the evaluation scripts assessing system performance on those datasets; and preliminary results on these three tasks that may serve as baseline for subsequent research in these phenomena.
Anthology ID:
W18-0702
Volume:
Proceedings of the First Workshop on Computational Models of Reference, Anaphora and Coreference
Month:
June
Year:
2018
Address:
New Orleans, Louisiana
Editors:
Massimo Poesio, Vincent Ng, Maciej Ogrodniczuk
Venue:
CRAC
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
11–22
Language:
URL:
https://aclanthology.org/W18-0702
DOI:
10.18653/v1/W18-0702
Bibkey:
Cite (ACL):
Massimo Poesio, Yulia Grishina, Varada Kolhatkar, Nafise Moosavi, Ina Roesiger, Adam Roussel, Fabian Simonjetz, Alexandra Uma, Olga Uryupina, Juntao Yu, and Heike Zinsmeister. 2018. Anaphora Resolution with the ARRAU Corpus. In Proceedings of the First Workshop on Computational Models of Reference, Anaphora and Coreference, pages 11–22, New Orleans, Louisiana. Association for Computational Linguistics.
Cite (Informal):
Anaphora Resolution with the ARRAU Corpus (Poesio et al., CRAC 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-0702.pdf
Data
Penn Treebank