Adaptor Grammars for Unsupervised Paradigm Clustering

Kate McCurdy, Sharon Goldwater, Adam Lopez


Abstract
This work describes the Edinburgh submission to the SIGMORPHON 2021 Shared Task 2 on unsupervised morphological paradigm clustering. Given raw text input, the task was to assign each token to a cluster with other tokens from the same paradigm. We use Adaptor Grammar segmentations combined with frequency-based heuristics to predict paradigm clusters. Our system achieved the highest average F1 score across 9 test languages, placing first out of 15 submissions.
Anthology ID:
2021.sigmorphon-1.9
Volume:
Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
August
Year:
2021
Address:
Online
Editors:
Garrett Nicolai, Kyle Gorman, Ryan Cotterell
Venue:
SIGMORPHON
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
82–89
Language:
URL:
https://aclanthology.org/2021.sigmorphon-1.9
DOI:
10.18653/v1/2021.sigmorphon-1.9
Bibkey:
Cite (ACL):
Kate McCurdy, Sharon Goldwater, and Adam Lopez. 2021. Adaptor Grammars for Unsupervised Paradigm Clustering. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 82–89, Online. Association for Computational Linguistics.
Cite (Informal):
Adaptor Grammars for Unsupervised Paradigm Clustering (McCurdy et al., SIGMORPHON 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.sigmorphon-1.9.pdf