A Manual Evaluation Method of Neural MT for Indigenous Languages

Linda Wiechetek, Flammie Pirinen, Per Kummervold


Abstract
Indigenous language expertise is not encoded in written text in the same way as it is for languages that have a long literal tradition. In many cases it is, on the contrary, mostly conserved orally. Therefore the evaluation of neural MT systems solely based on an algorithm learning from written texts is not adequate to measure the quality of a system that is used by the language community. If extensively using tools based on a big amount of non-native language this can even contribute to language change in a way that is not desired by the language community. It can also pollute the internet with automatically created texts that outweigh native texts. We propose a manual evaluation method focusing on flow and content separately, and additionally we use existing rule-based NLP to evaluate other factors such as spelling, grammar and grammatical richness. Our main conclusion is that language expertise of a native speaker is necessary to properly evaluate a given system. We test the method by manually evaluating two neural MT tools for an indigenous low resource language. We present an experiment on two different neural translations to and from North Sámi, an indigenous language of North Europe.
Anthology ID:
2023.humeval-1.1
Volume:
Proceedings of the 3rd Workshop on Human Evaluation of NLP Systems
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Anya Belz, Maja Popović, Ehud Reiter, Craig Thomson, João Sedoc
Venues:
HumEval | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
1–10
Language:
URL:
https://aclanthology.org/2023.humeval-1.1
DOI:
Bibkey:
Cite (ACL):
Linda Wiechetek, Flammie Pirinen, and Per Kummervold. 2023. A Manual Evaluation Method of Neural MT for Indigenous Languages. In Proceedings of the 3rd Workshop on Human Evaluation of NLP Systems, pages 1–10, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
A Manual Evaluation Method of Neural MT for Indigenous Languages (Wiechetek et al., HumEval-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.humeval-1.1.pdf