PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions

Agata Savary, Sara Stymne, Verginica Barbu Mititelu, Nathan Schneider, Carlos Ramisch, Joakim Nivre


Abstract
Multiword expressions (MWEs) are challenging and pervasive phenomena whose idiosyncratic properties show notably at the levels of lexicon, morphology, and syntax. Thus, they should best be annotated jointly with morphosyntax. We discuss two multilingual initiatives, Universal Dependencies and PARSEME, addressing these annotation layers in cross-lingually unified ways. We compare the annotation principles of these initiatives with respect to MWEs, and we put forward a roadmap towards their gradual unification. The expected outcomes are more consistent treebanking and higher universality in modeling idiosyncrasy.
Anthology ID:
2023.nejlt-1.2
Volume:
Northern European Journal of Language Technology, Volume 9
Month:
Year:
2023
Address:
Linköping, Sweden
Editor:
Leon Derczynski
Venue:
NEJLT
SIG:
Publisher:
Linköping University Electronic Press
Note:
Pages:
Language:
URL:
https://aclanthology.org/2023.nejlt-1.2
DOI:
https://doi.org/10.3384/nejlt.2000-1533.2023.4453
Bibkey:
Cite (ACL):
Agata Savary, Sara Stymne, Verginica Barbu Mititelu, Nathan Schneider, Carlos Ramisch, and Joakim Nivre. 2023. PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions. In Northern European Journal of Language Technology, Volume 9, Linköping, Sweden. Linköping University Electronic Press.
Cite (Informal):
PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions (Savary et al., NEJLT 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.nejlt-1.2.pdf