VerbLexPor: a lexical resource with semantic roles for Portuguese

Leonardo Zilio, Maria José Bocorny Finatto, Aline Villavicencio


Abstract
This paper presents a lexical resource developed for Portuguese. The resource contains sentences annotated with semantic roles. The sentences were extracted from two domains: Cardiology research papers and newspaper articles. Both corpora were analyzed with the PALAVRAS parser and subsequently processed with a subcategorization frames extractor, so that each sentence that contained at least one main verb was stored in a database together with its syntactic organization. The annotation was manually carried out by a linguist using an annotation interface. Both the annotated and non-annotated data were exported to an XML format, which is readily available for download. The reason behind exporting non-annotated data is that there is syntactic information collected from the parser annotation in the non-annotated data, and this could be useful for other researchers. The sentences from both corpora were annotated separately, so that it is possible to access sentences either from the Cardiology or from the newspaper corpus. The full resource presents more than seven thousand semantically annotated sentences, containing 192 different verbs and more than 15 thousand individual arguments and adjuncts.
Anthology ID:
L16-1422
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2656–2661
Language:
URL:
https://aclanthology.org/L16-1422
DOI:
Bibkey:
Cite (ACL):
Leonardo Zilio, Maria José Bocorny Finatto, and Aline Villavicencio. 2016. VerbLexPor: a lexical resource with semantic roles for Portuguese. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2656–2661, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
VerbLexPor: a lexical resource with semantic roles for Portuguese (Zilio et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1422.pdf
Data
FrameNet