DiHuTra: a Parallel Corpus to Analyse Differences between Human Translations

Ekaterina Lapshinova-Koltunski, Maja Popović, Maarit Koponen


Abstract
This project aimed to design a corpus of parallel human translations (HTs) of the same source texts by professionals and students. The resulting corpus consists of English news and reviews source texts, their translations into Russian and Croatian, and translations of the reviews into Finnish. The corpus will be valuable for both studying variation in translation and evaluating machine translation (MT) systems.
Anthology ID:
2022.eamt-1.58
Volume:
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation
Month:
June
Year:
2022
Address:
Ghent, Belgium
Editors:
Helena Moniz, Lieve Macken, Andrew Rufener, Loïc Barrault, Marta R. Costa-jussà, Christophe Declercq, Maarit Koponen, Ellie Kemp, Spyridon Pilos, Mikel L. Forcada, Carolina Scarton, Joachim Van den Bogaert, Joke Daems, Arda Tezcan, Bram Vanroy, Margot Fonteyne
Venue:
EAMT
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
337–338
Language:
URL:
https://aclanthology.org/2022.eamt-1.58
DOI:
Bibkey:
Cite (ACL):
Ekaterina Lapshinova-Koltunski, Maja Popović, and Maarit Koponen. 2022. DiHuTra: a Parallel Corpus to Analyse Differences between Human Translations. In Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, pages 337–338, Ghent, Belgium. European Association for Machine Translation.
Cite (Informal):
DiHuTra: a Parallel Corpus to Analyse Differences between Human Translations (Lapshinova-Koltunski et al., EAMT 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.eamt-1.58.pdf