Language Mixture to Develop Accurate Galician Dependency Parsers: An Exploration of Its Effects

Xabier Irastortza-Urbieta; José M. García Miguel; Marcos Garcia

Language Mixture to Develop Accurate Galician Dependency Parsers: An Exploration of Its Effects

Xabier Irastortza-Urbieta, José M. García-Miguel, Marcos Garcia

Abstract

The development of accurate syntactic parsers remains a challenge for low-resource languages. To overcome it, the literature has proposed leveraging syntactic annotations from typologically related languages. This work investigates the viability and adequacy of this approach for Galician, evaluating the use of annotations from major Romance languages as source data. Our methodology extends beyond standard automatic evaluation to incorporate a detailed error analysis, which precisely quantifies the effects of multilingual training and assesses the practical scalability of the method. The results establish the necessity of embedding models for effective cross-lingual transfer and demonstrate that even languages not particularly close can yield adequate parsers. This work confirms the benefits of cross-lingual data augmentation while delineating its scalability limits. Furthermore, the error analysis identifies specific, typologically conditioned grammatical dependencies that remain persistent challenges for accurate dependency parsing.

Anthology ID:: 2026.vardial-1.5
Volume:: Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Venues:: VarDial | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 58–69
Language:
URL:: https://aclanthology.org/2026.vardial-1.5/
DOI:
Bibkey:
Cite (ACL):: Xabier Irastortza-Urbieta, José M. García-Miguel, and Marcos Garcia. 2026. Language Mixture to Develop Accurate Galician Dependency Parsers: An Exploration of Its Effects. In Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects, pages 58–69, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Language Mixture to Develop Accurate Galician Dependency Parsers: An Exploration of Its Effects (Irastortza-Urbieta et al., VarDial 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.vardial-1.5.pdf

PDF Cite Search Fix data