A Derivational ChainBank for Modern Standard Arabic

Reham Marzouk, Sondos Krouna, Nizar Habash


Abstract
We introduce the new concept of an Arabic Derivational Chain Bank (CHAINBANK) to leverage the relationship between form and meaning in modeling Arabic derivational morphology. We constructed a knowledge graph network of abstract patterns and their derivational relations, and aligned it with the lemmas of the CAMELMORPH morphological analyzer database. This process produced chains of derived words’ lemmas linked to their correspond- ing lemma bases through derivational relations, encompassing 23,333 derivational connections. The CHAINBANK is publicly available.1
Anthology ID:
2025.abjadnlp-1.9
Volume:
Proceedings of the 1st Workshop on NLP for Languages Using Arabic Script
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editor:
Mo El-Haj
Venues:
AbjadNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
78–87
Language:
URL:
https://aclanthology.org/2025.abjadnlp-1.9/
DOI:
Bibkey:
Cite (ACL):
Reham Marzouk, Sondos Krouna, and Nizar Habash. 2025. A Derivational ChainBank for Modern Standard Arabic. In Proceedings of the 1st Workshop on NLP for Languages Using Arabic Script, pages 78–87, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
A Derivational ChainBank for Modern Standard Arabic (Marzouk et al., AbjadNLP 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.abjadnlp-1.9.pdf