Deep Active Learning for Morphophonological Processing

Seyed Morteza Mirbostani; Yasaman Boreshban; Salam Khalifa; SeyedAbolghasem Mirroshandel; Owen Rambow

doi:10.18653/v1/2023.acl-short.69

Deep Active Learning for Morphophonological Processing

Seyed Morteza Mirbostani, Yasaman Boreshban, Salam Khalifa, SeyedAbolghasem Mirroshandel, Owen Rambow

Abstract

Building a system for morphological processing is a challenging task in morphologically complex languages like Arabic. Although there are some deep learning based models that achieve successful results, these models rely on a large amount of annotated data. Building such datasets, specially for some of the lower-resource Arabic dialects, is very difficult, time-consuming, and expensive. In addition, some parts of the annotated data do not contain useful information for training machine learning models. Active learning strategies allow the learner algorithm to select the most informative samples for annotation. There has been little research that focuses on applying active learning for morphological inflection and morphophonological processing. In this paper, we have proposed a deep active learning method for this task. Our experiments on Egyptian Arabic show that with only about 30% of annotated data, we achieve the same results as does the state-of-the-art model on the whole dataset.

Anthology ID:: 2023.acl-short.69
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 793–803
Language:
URL:: https://aclanthology.org/2023.acl-short.69
DOI:: 10.18653/v1/2023.acl-short.69
Bibkey:
Cite (ACL):: Seyed Morteza Mirbostani, Yasaman Boreshban, Salam Khalifa, SeyedAbolghasem Mirroshandel, and Owen Rambow. 2023. Deep Active Learning for Morphophonological Processing. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 793–803, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Deep Active Learning for Morphophonological Processing (Mirbostani et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-short.69.pdf
Video:: https://aclanthology.org/2023.acl-short.69.mp4

PDF Cite Search Video