Challenge Test Sets for MT Evaluation

Maja Popović, Sheila Castilho


Abstract
Most of the test sets used for the evaluation of MT systems reflect the frequency distribution of different phenomena found in naturally occurring data (”standard” or ”natural” test sets). However, to better understand particular strengths and weaknesses of MT systems, especially those based on neural networks, it is necessary to apply more focused evaluation procedures. Therefore, another type of test sets (”challenge” test sets, also called ”test suites”) is being increasingly employed in order to highlight points of difficulty which are relevant to model development, training, or using of the given system. This tutorial will be useful for anyone (researchers, developers, users, translators) interested in detailed evaluation and getting a better understanding of machine translation (MT) systems and models. The attendees will learn about the motivation and linguistic background of challenge test sets and a range of testing possibilities applied to the state-of-the-art MT systems, as well as a number of practical aspects and challenges.
Anthology ID:
W19-7602
Volume:
Proceedings of Machine Translation Summit XVII: Tutorial Abstracts
Month:
August
Year:
2019
Address:
Dublin, Ireland
Editor:
Laura Rossi
Venue:
MTSummit
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
Language:
URL:
https://aclanthology.org/W19-7602
DOI:
Bibkey:
Cite (ACL):
Maja Popović and Sheila Castilho. 2019. Challenge Test Sets for MT Evaluation. In Proceedings of Machine Translation Summit XVII: Tutorial Abstracts, Dublin, Ireland. European Association for Machine Translation.
Cite (Informal):
Challenge Test Sets for MT Evaluation (Popović & Castilho, MTSummit 2019)
Copy Citation:
Presentation:
 W19-7602.Presentation.pdf