Approaching Reflex Predictions as a Classification Problem Using Extended Phonological Alignments

Tiago Tresoldi


Abstract
This work describes an implementation of the “extended alignment” model for cognate reflex prediction submitted to the “SIGTYP 2022 Shared Task on the Prediction of Cognate Reflexes”. Similarly to List et al. (2022a), the technique involves an automatic extension of sequence alignments with multilayered vectors that encode informational tiers on both site-specific traits, such as sound classes and distinctive features, as well as contextual and suprasegmental ones, conveyed by cross-site referrals and replication. The method allows to generalize the problem of cognate reflex prediction as a classification problem, with models trained using a parallel corpus of cognate sets. A model using random forests is trained and evaluated on the shared task for reflex prediction, and the experimental results are presented and discussed along with some differences to other implementations.
Anthology ID:
2022.sigtyp-1.11
Volume:
Proceedings of the 4th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
Month:
July
Year:
2022
Address:
Seattle, Washington
Editors:
Ekaterina Vylomova, Edoardo Ponti, Ryan Cotterell
Venue:
SIGTYP
SIG:
SIGTYP
Publisher:
Association for Computational Linguistics
Note:
Pages:
86–93
Language:
URL:
https://aclanthology.org/2022.sigtyp-1.11
DOI:
10.18653/v1/2022.sigtyp-1.11
Bibkey:
Cite (ACL):
Tiago Tresoldi. 2022. Approaching Reflex Predictions as a Classification Problem Using Extended Phonological Alignments. In Proceedings of the 4th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, pages 86–93, Seattle, Washington. Association for Computational Linguistics.
Cite (Informal):
Approaching Reflex Predictions as a Classification Problem Using Extended Phonological Alignments (Tresoldi, SIGTYP 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.sigtyp-1.11.pdf
Video:
 https://aclanthology.org/2022.sigtyp-1.11.mp4