Assessing the Impact of Incremental Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank

Chiara Alzetta, Felice Dell’Orletta, Simonetta Montemagni, Maria Simi, Giulia Venturi


Abstract
Detection and correction of errors and inconsistencies in “gold treebanks” are becoming more and more central topics of corpus annotation. The paper illustrates a new incremental method for enhancing treebanks, with particular emphasis on the extension of error patterns across different textual genres and registers. Impact and role of corrections have been assessed in a dependency parsing experiment carried out with four different parsers, whose results are promising. For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies.
Anthology ID:
W18-6001
Volume:
Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)
Month:
November
Year:
2018
Address:
Brussels, Belgium
Editors:
Marie-Catherine de Marneffe, Teresa Lynn, Sebastian Schuster
Venue:
UDW
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–7
Language:
URL:
https://aclanthology.org/W18-6001
DOI:
10.18653/v1/W18-6001
Bibkey:
Cite (ACL):
Chiara Alzetta, Felice Dell’Orletta, Simonetta Montemagni, Maria Simi, and Giulia Venturi. 2018. Assessing the Impact of Incremental Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank. In Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), pages 1–7, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
Assessing the Impact of Incremental Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank (Alzetta et al., UDW 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-6001.pdf