KU-CST at the SIGMORPHON 2020 Task 2 on Unsupervised Morphological Paradigm Completion

Manex Agirrezabal, Jürgen Wedekind


Abstract
We present a model for the unsupervised dis- covery of morphological paradigms. The goal of this model is to induce morphological paradigms from the bible (raw text) and a list of lemmas. We have created a model that splits each lemma in a stem and a suffix, and then we try to create a plausible suffix list by con- sidering lemma pairs. Our model was not able to outperform the official baseline, and there is still room for improvement, but we believe that the ideas presented here are worth considering.
Anthology ID:
2020.sigmorphon-1.11
Volume:
Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
July
Year:
2020
Address:
Online
Editors:
Garrett Nicolai, Kyle Gorman, Ryan Cotterell
Venue:
SIGMORPHON
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
111–116
Language:
URL:
https://aclanthology.org/2020.sigmorphon-1.11
DOI:
10.18653/v1/2020.sigmorphon-1.11
Bibkey:
Cite (ACL):
Manex Agirrezabal and Jürgen Wedekind. 2020. KU-CST at the SIGMORPHON 2020 Task 2 on Unsupervised Morphological Paradigm Completion. In Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 111–116, Online. Association for Computational Linguistics.
Cite (Informal):
KU-CST at the SIGMORPHON 2020 Task 2 on Unsupervised Morphological Paradigm Completion (Agirrezabal & Wedekind, SIGMORPHON 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.sigmorphon-1.11.pdf