Creation of a corpus with semantic role labels for Hungarian

Attila Novák, László Laki, Borbála Novák, Andrea Dömötör, Noémi Ligeti-Nagy, Ágnes Kalivoda


Abstract
In this article, an ongoing research is presented, the immediate goal of which is to create a corpus annotated with semantic role labels for Hungarian that can be used to train a parser-based system capable of formulating relevant questions about the text it processes. We briefly describe the objectives of our research, our efforts at eliminating errors in the Hungarian Universal Dependencies corpus, which we use as the base of our annotation effort, at creating a Hungarian verbal argument database annotated with thematic roles, at classifying adjuncts, and at matching verbal argument frames to specific occurrences of verbs and participles in the corpus.
Anthology ID:
W19-4026
Volume:
Proceedings of the 13th Linguistic Annotation Workshop
Month:
August
Year:
2019
Address:
Florence, Italy
Editors:
Annemarie Friedrich, Deniz Zeyrek, Jet Hoek
Venue:
LAW
SIG:
SIGANN
Publisher:
Association for Computational Linguistics
Note:
Pages:
220–229
Language:
URL:
https://aclanthology.org/W19-4026
DOI:
10.18653/v1/W19-4026
Bibkey:
Cite (ACL):
Attila Novák, László Laki, Borbála Novák, Andrea Dömötör, Noémi Ligeti-Nagy, and Ágnes Kalivoda. 2019. Creation of a corpus with semantic role labels for Hungarian. In Proceedings of the 13th Linguistic Annotation Workshop, pages 220–229, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Creation of a corpus with semantic role labels for Hungarian (Novák et al., LAW 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-4026.pdf
Data
Universal Dependencies