Representation of Yine [Arawak] Morphology by Finite State Transducer Formalism

Adriano Ingunza Torres, John Miller, Arturo Oncevay, Roberto Zariquiey Biondi


Abstract
We represent the complexity of Yine (Arawak) morphology with a finite state transducer (FST) based morphological analyzer. Yine is a low-resource indigenous polysynthetic Peruvian language spoken by approximately 3,000 people and is classified as ‘definitely endangered’ by UNESCO. We review Yine morphology focusing on morphophonology, possessive constructions and verbal predicates. Then we develop FSTs to model these components proposing techniques to solve challenging problems such as complex patterns of incorporating open and closed category arguments. This is a work in progress and we still have more to do in the development and verification of our analyzer. Our analyzer will serve both as a tool to better document the Yine language and as a component of natural language processing (NLP) applications such as spell checking and correction.
Anthology ID:
2021.americasnlp-1.11
Volume:
Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas
Month:
June
Year:
2021
Address:
Online
Editors:
Manuel Mager, Arturo Oncevay, Annette Rios, Ivan Vladimir Meza Ruiz, Alexis Palmer, Graham Neubig, Katharina Kann
Venue:
AmericasNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
102–112
Language:
URL:
https://aclanthology.org/2021.americasnlp-1.11
DOI:
10.18653/v1/2021.americasnlp-1.11
Bibkey:
Cite (ACL):
Adriano Ingunza Torres, John Miller, Arturo Oncevay, and Roberto Zariquiey Biondi. 2021. Representation of Yine [Arawak] Morphology by Finite State Transducer Formalism. In Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 102–112, Online. Association for Computational Linguistics.
Cite (Informal):
Representation of Yine [Arawak] Morphology by Finite State Transducer Formalism (Ingunza Torres et al., AmericasNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.americasnlp-1.11.pdf
Optional supplementary code:
 2021.americasnlp-1.11.OptionalSupplementaryCode.zip