Natalia Fedorova
2026
JEEM: Vision-Language Understanding in Four Arabic Dialects
Karima Kadaoui | Hanin Atwany | Hamdan Al-Ali | Abdelrahman Mohamed | Ali Mekky | Sergei Tilga | Natalia Fedorova | Ekaterina Artemova | Hanan Aldarmaki | Yova Kementchedjhieva
Findings of the Association for Computational Linguistics: EACL 2026
Karima Kadaoui | Hanin Atwany | Hamdan Al-Ali | Abdelrahman Mohamed | Ali Mekky | Sergei Tilga | Natalia Fedorova | Ekaterina Artemova | Hanan Aldarmaki | Yova Kementchedjhieva
Findings of the Association for Computational Linguistics: EACL 2026
We introduce JEEM, a benchmark designed to evaluate Vision-Language Models (VLMs) on visual understanding across four Arabic-speaking countries: Jordan, The Emirates, Egypt, and Morocco. JEEM includes the tasks of image captioning and visual question answering, and features culturally rich and regionally diverse content. This dataset aims to assess the ability of VLMs to generalize across dialects and accurately interpret cultural elements in visual contexts. In an evaluation of five prominent open-source Arabic VLMs and GPT-4o, we find that the Arabic VLMs consistently underperform, struggling with both visual understanding and dialect-specific generation. While GPT-4o ranks best in this comparison, the model’s linguistic competence varies across dialects, and its visual understanding capabilities lag behind. This underscores the need for more inclusive models and the value of culturally-diverse evaluation paradigms.
2024
LLMs Simulate Big5 Personality Traits: Further Evidence
Aleksandra Sorokovikova | Sharwin Rezagholi | Natalia Fedorova | Ivan P. Yamshchikov
Proceedings of the 1st Workshop on Personalization of Generative AI Systems (PERSONALIZE 2024)
Aleksandra Sorokovikova | Sharwin Rezagholi | Natalia Fedorova | Ivan P. Yamshchikov
Proceedings of the 1st Workshop on Personalization of Generative AI Systems (PERSONALIZE 2024)
An empirical investigation into the simulation of the Big5 personality traits by large language models (LLMs), namely Llama-2, GPT-4, and Mixtral, is presented. We analyze the personality traits simulated by these models and their stability. This contributes to the broader understanding of the capabilities of LLMs to simulate personality traits and the respective implications for personalized human-computer interaction.
2022
Findings of the WMT’22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages
David Ifeoluwa Adelani | Md Mahfuz Ibn Alam | Antonios Anastasopoulos | Akshita Bhagia | Marta R. Costa-jussà | Jesse Dodge | Fahim Faisal | Christian Federmann | Natalia Fedorova | Francisco Guzmán | Sergey Koshelev | Jean Maillard | Vukosi Marivate | Jonathan Mbuya | Alexandre Mourachko | Safiyyah Saleem | Holger Schwenk | Guillaume Wenzek
Proceedings of the Seventh Conference on Machine Translation (WMT)
David Ifeoluwa Adelani | Md Mahfuz Ibn Alam | Antonios Anastasopoulos | Akshita Bhagia | Marta R. Costa-jussà | Jesse Dodge | Fahim Faisal | Christian Federmann | Natalia Fedorova | Francisco Guzmán | Sergey Koshelev | Jean Maillard | Vukosi Marivate | Jonathan Mbuya | Alexandre Mourachko | Safiyyah Saleem | Holger Schwenk | Guillaume Wenzek
Proceedings of the Seventh Conference on Machine Translation (WMT)
We present the results of the WMT’22 SharedTask on Large-Scale Machine Translation Evaluation for African Languages. The shared taskincluded both a data and a systems track, alongwith additional innovations, such as a focus onAfrican languages and extensive human evaluation of submitted systems. We received 14system submissions from 8 teams, as well as6 data track contributions. We report a largeprogress in the quality of translation for Africanlanguages since the last iteration of this sharedtask: there is an increase of about 7.5 BLEUpoints across 72 language pairs, and the average BLEU scores went from 15.09 to 22.60.
Search
Fix author
Co-authors
- David Ifeoluwa Adelani 1
- Hamdan Al-Ali 1
- Md Mahfuz Ibn Alam 1
- Hanan Aldarmaki 1
- Antonios Anastasopoulos 1
- Ekaterina Artemova 1
- Hanin Atwany 1
- Akshita Bhagia 1
- Marta R. Costa-jussà 1
- Jesse Dodge 1
- Fahim Faisal 1
- Christian Federmann 1
- Francisco Guzmán 1
- Karima Kadaoui 1
- Yova Kementchedjhieva 1
- Sergey Koshelev 1
- Jean Maillard 1
- Vukosi Marivate 1
- Jonathan Mbuya 1
- Ali Mekky 1
- Abdelrahman Mohamed 1
- Alexandre Mourachko 1
- Sharwin Rezagholi 1
- Safiyyah Saleem 1
- Holger Schwenk 1
- Aleksandra Sorokovikova 1
- Sergei Tilga 1
- Guillaume Wenzek 1
- Ivan P. Yamshchikov 1