Sarah Alkuhlani


2014

pdf bib
Large Scale Arabic Error Annotation: Guidelines and Framework
Wajdi Zaghouani | Behrang Mohit | Nizar Habash | Ossama Obeid | Nadi Tomeh | Alla Rozovskaya | Noura Farra | Sarah Alkuhlani | Kemal Oflazer
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

We present annotation guidelines and a web-based annotation framework developed as part of an effort to create a manually annotated Arabic corpus of errors and corrections for various text types. Such a corpus will be invaluable for developing Arabic error correction tools, both for training models and as a gold standard for evaluating error correction algorithms. We summarize the guidelines we created. We also describe issues encountered during the training of the annotators, as well as problems that are specific to the Arabic language that arose during the annotation process. Finally, we present the annotation tool that was developed as part of this project, the annotation pipeline, and the quality of the resulting annotations.

2013

pdf bib
Automatic Morphological Enrichment of a Morphologically Underspecified Treebank
Sarah Alkuhlani | Nizar Habash | Ryan Roth
Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

2012

pdf bib
Identifying Broken Plurals, Irregular Gender, and Rationality in Arabic Text
Sarah Alkuhlani | Nizar Habash
Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics

2011

pdf bib
A Corpus for Modeling Morpho-Syntactic Agreement in Arabic: Gender, Number and Rationality
Sarah Alkuhlani | Nizar Habash
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies