J. van Genabith


2006

pdf bib
DCU 250 Arabic Dependency Bank: An LFG Gold Standard Resource for the Arabic Penn Treebank
Yafa Al-Raheb | A. Akrout | J. van Genabith
Proceedings of the International Conference on the Challenge of Arabic for NLP/MT

This paper describes the construction of a dependency bank gold standard for Arabic, DCU 250 Arabic Dependency Bank (DCU 250), based on the Arabic Penn Treebank Corpus (ATB) (Bies and Maamouri, 2003; Maamouri and Bies, 2004) within the theoretical framework of Lexical Functional Grammar (LFG). For parsing and automatically extracting grammatical and lexical resources from treebanks, it is necessary to evaluate against established gold standard resources. Gold standards for various languages have been developed, but to our knowledge, such a resource has not yet been constructed for Arabic. The construction of the DCU 250 marks the first step towards the creation of an automatic LFG f-structure annotation algorithm for the ATB, and for the extraction of Arabic grammatical and lexical resources.
Search
Venues