Noelia Sánchez


2016

pdf bib
Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation
Borja Navarro | María Ribes Lafoz | Noelia Sánchez
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

In order to analyze metrical and semantics aspects of poetry in Spanish with computational techniques, we have developed a large corpus annotated with metrical information. In this paper we will present and discuss the development of this corpus: the formal representation of metrical patterns, the semi-automatic annotation process based on a new automatic scansion system, the main annotation problems, and the evaluation, in which an inter-annotator agreement of 96% has been obtained. The corpus is open and available.