Data Analytics Meet Machine Translation

Allen Che, Martin Xiao


Abstract
Machine translation becomes a critical piece of localization industry. With all kinds of different data, how to monitor the machine translation quality in your localized content? How to build the quality analytics framework? This paper describes a process starting from collecting the daily operation data then cleaning the data and building the analytics framework to get the insight into the data. Finally we’re going to share how to build the data collecting matrix, and the script to clean up the data, then run the analytics with an automation script. In the last, we would share the different visualized reports, such as Box Polit, Standard Deviation, Mean, MT touchpoint and golden ratio reports.
Anthology ID:
2022.amta-upg.11
Volume:
Proceedings of the 15th Biennial Conference of the Association for Machine Translation in the Americas (Volume 2: Users and Providers Track and Government Track)
Month:
September
Year:
2022
Address:
Orlando, USA
Editors:
Janice Campbell, Stephen Larocca, Jay Marciano, Konstantin Savenkov, Alex Yanishevsky
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
144–158
Language:
URL:
https://aclanthology.org/2022.amta-upg.11
DOI:
Bibkey:
Cite (ACL):
Allen Che and Martin Xiao. 2022. Data Analytics Meet Machine Translation. In Proceedings of the 15th Biennial Conference of the Association for Machine Translation in the Americas (Volume 2: Users and Providers Track and Government Track), pages 144–158, Orlando, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Data Analytics Meet Machine Translation (Che & Xiao, AMTA 2022)
Copy Citation:
Presentation:
 2022.amta-upg.11.Presentation.pdf