CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology

Aditi Chaudhary, Elizabeth Salesky, Gayatri Bhat, David R. Mortensen, Jaime Carbonell, Yulia Tsvetkov


Abstract
This paper presents the submission by the CMU-01 team to the SIGMORPHON 2019 task 2 of Morphological Analysis and Lemmatization in Context. This task requires us to produce the lemma and morpho-syntactic description of each token in a sequence, for 107 treebanks. We approach this task with a hierarchical neural conditional random field (CRF) model which predicts each coarse-grained feature (eg. POS, Case, etc.) independently. However, most treebanks are under-resourced, thus making it challenging to train deep neural models for them. Hence, we propose a multi-lingual transfer training regime where we transfer from multiple related languages that share similar typology.
Anthology ID:
W19-4208
Volume:
Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
August
Year:
2019
Address:
Florence, Italy
Editors:
Garrett Nicolai, Ryan Cotterell
Venue:
ACL
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
57–70
Language:
URL:
https://aclanthology.org/W19-4208
DOI:
10.18653/v1/W19-4208
Bibkey:
Cite (ACL):
Aditi Chaudhary, Elizabeth Salesky, Gayatri Bhat, David R. Mortensen, Jaime Carbonell, and Yulia Tsvetkov. 2019. CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology. In Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 57–70, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology (Chaudhary et al., ACL 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-4208.pdf
Code
 Aditi138/MorphologicalAnalysis