Is AI the new ”Human evaluator”?

Aneta Sapeta


Abstract
The AI tide has been present in the Localization industry for many years now, and even though there is a big hype around it, it is still trying to find its place in localization. Some are trying to use it as an NMT replacement for the current market models, and others as a helping tool in evaluating the NMT outputs by having less Human input in evaluating the MT quality. From our experience, we are still depending on Human evaluation for assessment, but how good of an evaluator can AI be? From our tests, evaluating the MT quality by the AI can be a challenging task (even though we have seen significant progress in recent years) as it requires the system to understand the meaning of the source, and the target, and then to be able to judge the quality by assessing the more or less visible errors, and to be unbiased in giving its assessment. In this presentation, we want to show our insights on the reliability of AI for MT and whether we can exclude humans from the evaluation circle.
Anthology ID:
2024.amta-presentations.4
Volume:
Proceedings of the 16th Conference of the Association for Machine Translation in the Americas (Volume 2: Presentations)
Month:
September
Year:
2024
Address:
Chicago, USA
Editors:
Marianna Martindale, Janice Campbell, Konstantin Savenkov, Shivali Goel
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
30–44
Language:
URL:
https://aclanthology.org/2024.amta-presentations.4
DOI:
Bibkey:
Cite (ACL):
Aneta Sapeta. 2024. Is AI the new ”Human evaluator”?. In Proceedings of the 16th Conference of the Association for Machine Translation in the Americas (Volume 2: Presentations), pages 30–44, Chicago, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Is AI the new ”Human evaluator”? (Sapeta, AMTA 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.amta-presentations.4.pdf