A review of Spanish corpora annotated with negation

Salud María Jiménez-Zafra, Roser Morante, Maite Martin, L. Alfonso Ureña-López


Abstract
The availability of corpora annotated with negation information is essential to develop negation processing systems in any language. However, there is a lack of these corpora even for languages like English, and when there are corpora available they are small and the annotations are not always compatible across corpora. In this paper we review the existing corpora annotated with negation in Spanish with the purpose of first, gathering the information to make it available for other researchers and, second, analyzing how compatible are the corpora and how has the linguistic phenomenon been addressed. Our final aim is to develop a supervised negation processing system for Spanish, for which we need training and test data. Our analysis shows that it will not be possible to merge the small corpora existing for Spanish due to lack of compatibility in the annotations.
Anthology ID:
C18-1078
Volume:
Proceedings of the 27th International Conference on Computational Linguistics
Month:
August
Year:
2018
Address:
Santa Fe, New Mexico, USA
Editors:
Emily M. Bender, Leon Derczynski, Pierre Isabelle
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
915–924
Language:
URL:
https://aclanthology.org/C18-1078
DOI:
Bibkey:
Cite (ACL):
Salud María Jiménez-Zafra, Roser Morante, Maite Martin, and L. Alfonso Ureña-López. 2018. A review of Spanish corpora annotated with negation. In Proceedings of the 27th International Conference on Computational Linguistics, pages 915–924, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):
A review of Spanish corpora annotated with negation (Jiménez-Zafra et al., COLING 2018)
Copy Citation:
PDF:
https://aclanthology.org/C18-1078.pdf
Code
 sjzafra/spanish_negation_corpora