Unsupervised Paradigm Clustering Using Transformation Rules

Changbing Yang, Garrett Nicolai, Miikka Silfverberg


Abstract
This paper describes the submission of the CU-UBC team for the SIGMORPHON 2021 Shared Task 2: Unsupervised morphological paradigm clustering. Our system generates paradigms using morphological transformation rules which are discovered from raw data. We experiment with two methods for discovering rules. Our first approach generates prefix and suffix transformations between similar strings. Secondly, we experiment with more general rules which can apply transformations inside the input strings in addition to prefix and suffix transformations. We find that the best overall performance is delivered by prefix and suffix rules but more general transformation rules perform better for languages with templatic morphology and very high morpheme-to-word ratios.
Anthology ID:
2021.sigmorphon-1.11
Volume:
Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
August
Year:
2021
Address:
Online
Editors:
Garrett Nicolai, Kyle Gorman, Ryan Cotterell
Venue:
SIGMORPHON
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
98–106
Language:
URL:
https://aclanthology.org/2021.sigmorphon-1.11
DOI:
10.18653/v1/2021.sigmorphon-1.11
Bibkey:
Cite (ACL):
Changbing Yang, Garrett Nicolai, and Miikka Silfverberg. 2021. Unsupervised Paradigm Clustering Using Transformation Rules. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 98–106, Online. Association for Computational Linguistics.
Cite (Informal):
Unsupervised Paradigm Clustering Using Transformation Rules (Yang et al., SIGMORPHON 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.sigmorphon-1.11.pdf