Neural Machine Translation for French–Mooré: Adapting Large Language Models to Low-Resource Languages

Walker Stanislas Rocksane COMPAORE; Maimouna Ouattara; Rodrique Kafando; Tegawendé F. Bissyandé; Abdoul Kader Kabore; Aminata Sabane

Neural Machine Translation for French–Mooré: Adapting Large Language Models to Low-Resource Languages

Walker Stanislas Rocksane COMPAORE, Maimouna Ouattara, Rodrique Kafando, Tegawendé F. Bissyandé, Abdoul Kader Kabore, Aminata Sabane

Abstract

This work focuses on neural machine translation between French and Mooré, leveraging the capabilities of Large Language Models (LLMs) in a low-resource language context. Mooré is a local language widely spoken in Burkina Faso but remains underrepresented in digital resources. Alongside Mooré, French, now a working language, remains widely used in administration, education, justice, etc. The coexistence of these two languages creates a growing demand for effective translation tools. However, Mooré, like many low-resource languages, poses significant challenges for machine translation due to the scarcity of parallel corpora and its complex morphology.The main objective of this work is to adapt LLMs for French–Mooré translation. Three pre-trained models were selected: No Language Left Behind (NLLB-200), mBART50, and AfroLM. A corpus of approximately 83,000 validated sentence pairs was compiled from an initial collection of 97,060 pairs through pre-processing, semantic filtering, and human evaluation. Specific adaptations to tokenizers and model architectures were applied to improve translation quality.The results show that the fine-tuned NLLB model outperforms the others, highlighting the importance of native language support. mBART50 achieves comparable performance after fine-tuning, while AfroLM remains less effective. Despite existing limitations, this study demonstrates the potential of fine-tuned LLMs for African low-resource languages.

Anthology ID:: 2026.loreslm-1.53
Volume:: Proceedings of the Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Hansi Hettiarachchi, Tharindu Ranasinghe, Alistair Plum, Paul Rayson, Ruslan Mitkov, Mohamed Gaber, Damith Premasiri, Fiona Anting Tan, Lasitha Uyangodage
Venue:: LoResLM
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 615–622
Language:
URL:: https://aclanthology.org/2026.loreslm-1.53/
DOI:
Bibkey:
Cite (ACL):: Walker Stanislas Rocksane COMPAORE, Maimouna Ouattara, Rodrique Kafando, Tegawendé F. Bissyandé, Abdoul Kader Kabore, and Aminata Sabane. 2026. Neural Machine Translation for French–Mooré: Adapting Large Language Models to Low-Resource Languages. In Proceedings of the Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026), pages 615–622, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Neural Machine Translation for French–Mooré: Adapting Large Language Models to Low-Resource Languages (Stanislas Rocksane COMPAORE et al., LoResLM 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.loreslm-1.53.pdf

PDF Cite Search Fix data