Driss Namly
2026
QAMAR: A New Fully Verified and Accurate Quranic Arabic Morphological Analysis Resource.
Sara Faqihi | Karim Bouzoubaa | Rachida Tajmout | Driss Namly
Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script
Sara Faqihi | Karim Bouzoubaa | Rachida Tajmout | Driss Namly
Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script
Several Quranic morphological corpora have been developed to support Arabic linguistic analysis and NLP applications, yet they often lack full coverage, consistency, or manual verification. We present QAMAR, a morphologically oriented, multi-task corpus derived from the Qur’an. This comprehensive, manually verified resource provides a detailed linguistic layer for every Quranic word, including the Modern Standard Arabic (MSA) equivalent, the stem, the lemma, the root, and the part of speech (POS). QAMAR supports multiple NLP tasks, such as normalization, lemmatization, root extraction, and POS tagging, and serves as a gold-standard reference for Quranic and Arabic NLP research, including corpus-to-corpus evaluation and morphological analyzer benchmarking. The paper details QAMAR’s annotation framework, verification process, and resource structure, and reports comparative analyses with existing Quranic morphological resources and outputs produced by current large language models (LLMs).
Murabaa: A comprehensive Resource Platform for Arabic Morphology
Karim Bouzoubaa | Driss Namly | Hamid Jihad | Rachida Tajmout | Jamal Ezzouaine | Hakima Khamar
Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script
Karim Bouzoubaa | Driss Namly | Hamid Jihad | Rachida Tajmout | Jamal Ezzouaine | Hakima Khamar
Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script
Arabic language faces technical and cultural challenges, including a lack of high-quality resources and the prevalence of regional dialects, which hinders the development of effective language processing systems. Therefore, the "Murabaa" platform was developed to transform Arabic linguistic knowledge into integrated digital resources. The platform aims to provide accurate digital content and promote the use of Arabic in various fields to bridge the gap between tradition and modernity by offering integrated linguistic resources for developing advanced research tools. The platform provides eight accurate dictionaries in the form of a website and a web application, contributing to the digitization of knowledge and its representation within the framework of standard lexical markup. In this study, we also conduct a quantitative comparison of the resources against similar ones to assess the quality of the linguistic knowledge they provide.
2021
A description and demonstration of SAFAR framework
Karim Bouzoubaa | Younes Jaafar | Driss Namly | Ridouane Tachicart | Rachida Tajmout | Hakima Khamar | Hamid Jaafar | Lhoussain Aouragh | Abdellah Yousfi
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
Karim Bouzoubaa | Younes Jaafar | Driss Namly | Ridouane Tachicart | Rachida Tajmout | Hakima Khamar | Hamid Jaafar | Lhoussain Aouragh | Abdellah Yousfi
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
Several tools and resources have been developed to deal with Arabic NLP. However, a homogenous and flexible Arabic environment that gathers these components is rarely available. In this perspective, we introduce SAFAR which is a monolingual framework developed in accordance with software engineering requirements and dedicated to Arabic language, especially, the modern standard Arabic and Moroccan dialect. After one decade of integration and development, SAFAR possesses today more than 50 tools and resources that can be exploited either using its API or using its web interface.