qxoRef 1.0: A coreference corpus and mention-pair baseline for coreference resolution in Conchucos Quechua

Elizabeth Pankratz


Abstract
This paper introduces qxoRef 1.0, the first coreference corpus to be developed for a Quechuan language, and describes a baseline mention-pair coreference resolution system developed for this corpus. The evaluation of this system will illustrate that earlier steps in the NLP pipeline, in particular syntactic parsing, should be in place before a complex task like coreference resolution can truly succeed. qxoRef 1.0 is freely available under a CC-BY-NC-SA 4.0 license.
Anthology ID:
2021.americasnlp-1.1
Volume:
Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas
Month:
June
Year:
2021
Address:
Online
Venues:
AmericasNLP | NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–9
Language:
URL:
https://aclanthology.org/2021.americasnlp-1.1
DOI:
10.18653/v1/2021.americasnlp-1.1
Bibkey:
Cite (ACL):
Elizabeth Pankratz. 2021. qxoRef 1.0: A coreference corpus and mention-pair baseline for coreference resolution in Conchucos Quechua. In Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 1–9, Online. Association for Computational Linguistics.
Cite (Informal):
qxoRef 1.0: A coreference corpus and mention-pair baseline for coreference resolution in Conchucos Quechua (Pankratz, AmericasNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.americasnlp-1.1.pdf
Code
 epankratz/qxoref