2025
pdf
bib
abs
Bilingual resources for Moroccan Sign Language Generation and Standard Arabic Skills Improvement of Deaf Children
Abdelhadi Soudi
|
Corinne Vinopol
|
Kristof Van Laerhoven
Proceedings of the 18th Workshop on Building and Using Comparable Corpora (BUCC)
This paper presents a set of bilingual Standard Arabic (SA)-Moroccan Sign Language (MSL) tools and resources to improve Moroccan Deaf children’s SA skills. An MSL Generator based on rule-based machine translation (MT) is described that enables users and educators of Deaf children, in particular, to enter Arabic text and generate its corresponding MSL translation in both graphic and video format. The generated graphics can be printed and imported into an Arabic reading passage. We have also developed MSL Clip and Create software that includes a bilingual database of 3,000 MSL signs and SA words, a Publisher for the incorporation of MSL graphic support into SA reading passages, and six Templates that create customized bilingual crossword puzzles, word searches, Bingo cards, matching games, flashcards, and fingerspelling scrambles. A crowdsourcing platform for MSL data collection is also described. A major social benefit of the development of these resources is in relation to equity and the status of deaf people in Moroccan society. More appropriate resources for the bilingual education of Deaf children (in MSL and SA) will lead to improved quality of educational services.
2024
pdf
bib
Exploring the Potential of Large Language Models in Adaptive Machine Translation for Generic Text and Subtitles
Abdelhadi Soudi
|
Mohamed Hannani
|
Kristof Van Laerhoven
|
Eleftherios Avramidis
Proceedings of the 17th Workshop on Building and Using Comparable Corpora (BUCC) @ LREC-COLING 2024
pdf
bib
abs
Assessing the Performance of ChatGPT-4, Fine-tuned BERT and Traditional ML Models on Moroccan Arabic Sentiment Analysis
Mohamed Hannani
|
Abdelhadi Soudi
|
Kristof Van Laerhoven
Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities
Large Language Models (LLMs) have demonstrated impressive capabilities in various natural language processing tasks across different languages. However, their performance in low-resource languages and dialects, such as Moroccan Arabic (MA), requires further investigation. This study evaluates the performance of ChatGPT-4, different fine-tuned BERT models, FastText as text representation, and traditional machine learning models on MA sentiment analysis. Experiments were done on two open source MA datasets: an X(Twitter) Moroccan Arabic corpus (MAC) and a Moroccan Arabic YouTube corpus (MYC) datasets to assess their capabilities on sentiment text classification. We compare the performance of fully fine-tuned and pre-trained Arabic BERT-based models with ChatGPT-4 in zero-shot settings.
2006
pdf
bib
abs
IMORPHĒ: An Inheritance and Equivalence Based Morphology Description Compiler
Violetta Cavalli-Sforza
|
Abdelhadi Soudi
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
IMORPHĒ is a significantly extended version of MORPHE, a morphology description compiler. MORPHEs morphology description language is based on two constructs: 1) a morphological form hierarchy, whose nodes relate and differentiate surface forms in terms of the common and distinguishing inflectional features of lexical items; and 2) transformational rules, attached to leaf nodes of the hierarchy, which generate the surface form of an item from the base form stored in the lexicon. While MORPHEs approach to morphology description is intuitively appealing and was successfully used for generating the morphology of several European languages, its application to Modern Standard Arabic yielded morphological descriptions that were highly complex and redundant. Previous modifications and enhancements attempted to capture more elegantly and concisely different aspects of the complex morphology of Arabic, finding theoretical grounding in Lexeme-Based Morphology. Those extensions are being incorporated in a more flexible and less ad hoc fashion in IMORPHE, which retains the unique features of our previous work but embeds them in an inheritance-based framework in order to achieve even more concise and modular morphology descriptions and greater runtime efficiency, and lays the groundwork for IMORPHE to become an analyzer as well as a generator.
2005
pdf
bib
Memory-Based Morphological Analysis Generation and Part-of-Speech Tagging of Arabic
Erwin Marsi
|
Antal van den Bosch
|
Abdelhadi Soudi
Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages
2004
pdf
bib
Generating an Arabic Full-form Lexicon for Bidirectional Morphology Lookup
Abdelhadi Soudi
|
Andreas Eisele
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
pdf
bib
An Emerging Transcontinental Collaborative Research and Education Agenda in Human Language Technologies
Gregory Ernest Monaco
|
Abdelhadi Soudi
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
2000
pdf
bib
Arabic Morphology Generation Using a Concatenative Strategy
Violetta Cavalli-Sforza
|
Abdelhadi Soudi
|
Teruko Mitamura
1st Meeting of the North American Chapter of the Association for Computational Linguistics