Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation

Borja Navarro, María Ribes Lafoz, Noelia Sánchez


Abstract
In order to analyze metrical and semantics aspects of poetry in Spanish with computational techniques, we have developed a large corpus annotated with metrical information. In this paper we will present and discuss the development of this corpus: the formal representation of metrical patterns, the semi-automatic annotation process based on a new automatic scansion system, the main annotation problems, and the evaluation, in which an inter-annotator agreement of 96% has been obtained. The corpus is open and available.
Anthology ID:
L16-1691
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4360–4364
Language:
URL:
https://aclanthology.org/L16-1691
DOI:
Bibkey:
Cite (ACL):
Borja Navarro, María Ribes Lafoz, and Noelia Sánchez. 2016. Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 4360–4364, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation (Navarro et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1691.pdf