Raghad Alateeq
2023
Evaluating ChatGPT and Bard AI on Arabic Sentiment Analysis
Abdulmohsen Al-Thubaity
|
Sakhar Alkhereyf
|
Hanan Murayshid
|
Nouf Alshalawi
|
Maha Omirah
|
Raghad Alateeq
|
Rawabi Almutairi
|
Razan Alsuwailem
|
Manal Alhassoun
|
Imaan Alkhanen
Proceedings of ArabicNLP 2023
Large Language Models (LLMs) such as ChatGPT and Bard AI have gained much attention due to their outstanding performance on a range of NLP tasks. These models have demonstrated remarkable proficiency across various languages without the necessity for full supervision. Nevertheless, their performance in low-resource languages and dialects, like Arabic dialects in comparison to English, remains to be investigated. In this paper, we conduct a comprehensive evaluation of three LLMs for Dialectal Arabic Sentiment Analysis: namely, ChatGPT based on GPT-3.5 and GPT-4, and Bard AI. We use a Saudi dialect Twitter dataset to assess their capability in sentiment text classification and generation. For classification, we compare the performance of fully fine-tuned Arabic BERT-based models with the LLMs in few-shot settings. For data generation, we evaluate the quality of the generated new sentiment samples using human and automatic evaluation methods. The experiments reveal that GPT-4 outperforms GPT-3.5 and Bard AI in sentiment analysis classification, rivaling the top-performing fully supervised BERT-based language model. However, in terms of data generation, compared to manually annotated authentic data, these generative models often fall short in producing high-quality Dialectal Arabic text suitable for sentiment analysis.
Search
Co-authors
- Abdulmohsen Al-Thubaity 1
- Sakhar Alkhereyf 1
- Hanan Murayshid 1
- Nouf Alshalawi 1
- Maha Omirah 1
- show all...