2024
pdf
bib
abs
AREEj: Arabic Relation Extraction with Evidence
Osama Rakan Al Mraikhat
|
Hadi Hamoud
|
Fadi A. Zaraket
Proceedings of The Second Arabic Natural Language Processing Conference
Relational entity extraction is key in building knowledge graphs. A relational entity has a source, a tail and atype. In this paper, we consider Arabic text and introduce evidence enrichment which intuitivelyinforms models for better predictions. Relational evidence is an expression in the textthat explains how sources and targets relate. %It also provides hints from which models learn. This paper augments the existing relational extraction dataset with evidence annotation to its 2.9-million Arabic relations.We leverage the augmented dataset to build , a relation extraction with evidence model from Arabic documents. The evidence augmentation model we constructed to complete the dataset achieved .82 F1-score (.93 precision, .73 recall). The target outperformed SOTA mREBEL with .72 F1-score (.78 precision, .66 recall).
pdf
bib
abs
DRU at WojoodNER 2024: A Multi-level Method Approach
Hadi Hamoud
|
Chadi Abou Chakra
|
Nancy Hamdan
|
Osama Rakan Al Mraikhat
|
Doha Albared
|
Fadi A. Zaraket
Proceedings of The Second Arabic Natural Language Processing Conference
In this paper, we present our submission for the WojoodNER 2024 Shared Tasks addressing flat and nested sub-tasks (1, 2). We experiment with three different approaches. We train (i) an Arabic fine-tuned version of BLOOMZ-7b-mt, GEMMA-7b, and AraBERTv2 on multi-label token classifications task; (ii) two AraBERTv2 models, on main types and sub-types respectively; and (iii) one model for main types and four for the four sub-types. Based on the Wojood NER 2024 test set results, the three fine-tuned models performed similarly with AraBERTv2 favored (F1: Flat=.8780 Nested=.9040). The five model approach performed slightly better (F1: Flat=.8782 Nested=.9043).
pdf
bib
abs
DRU at WojoodNER 2024: ICL LLM for Arabic NER
Nancy Hamdan
|
Hadi Hamoud
|
Chadi Abou Chakra
|
Osama Rakan Al Mraikhat
|
Doha Albared
|
Fadi A. Zaraket
Proceedings of The Second Arabic Natural Language Processing Conference
This paper details our submission to the WojoodNER Shared Task 2024, leveraging in-context learning with large language models for Arabic Named Entity Recognition. We utilized the Command R model, to perform fine-grained NER on the Wojood-Fine corpus. Our primary approach achieved an F1 score of 0.737 and a recall of 0.756. Post-processing the generated predictions to correct format inconsistencies resulted in an increased recall of 0.759, and a similar F1 score of 0.735. A multi-level prompting method and aggregation of outputs resulted in a lower F1 score of 0.637. Our results demonstrate the potential of ICL for Arabic NER while highlighting challenges related to LLM output consistency.
2017
pdf
bib
Morphology-based Entity and Relational Entity Extraction Framework for Arabic
Amin Jaber
|
Fadi A. Zaraket
Traitement Automatique des Langues, Volume 58, Numéro 3 : Traitement automatique de l'arabe et des langues apparentées [NLP for Arabic and Related Languages]