Interpreting Strategies Annotation in the WAW Corpus

Irina Temnikova, Ahmed Abdelali, Samy Hedaya, Stephan Vogel, Aishah Al Daher


Abstract
With the aim to teach our automatic speech-to-text translation system human interpreting strategies, our first step is to identify which interpreting strategies are most often used in the language pair of our interest (English-Arabic). In this article we run an automatic analysis of a corpus of parallel speeches and their human interpretations, and provide the results of manually annotating the human interpreting strategies in a sample of the corpus. We give a glimpse of the corpus, whose value surpasses the fact that it contains a high number of scientific speeches with their interpretations from English into Arabic, as it also provides rich information about the interpreters. We also discuss the difficulties, which we encountered on our way, as well as our solutions to them: our methodology for manual re-segmentation and alignment of parallel segments, the choice of annotation tool, and the annotation procedure. Our annotation findings explain the previously extracted specific statistical features of the interpreted corpus (compared with a translation one) as well as the quality of interpretation provided by different interpreters.
Anthology ID:
W17-7905
Volume:
Proceedings of the Workshop Human-Informed Translation and Interpreting Technology
Month:
September
Year:
2017
Address:
Varna, Bulgaria
Editors:
Irina Temnikova, Constantin Orasan, Gloria Corpas Pastor, Stephan Vogel
Venue:
RANLP
SIG:
Publisher:
Association for Computational Linguistics, Shoumen, Bulgaria
Note:
Pages:
36–43
Language:
URL:
https://doi.org/10.26615/978-954-452-042-7_005
DOI:
10.26615/978-954-452-042-7_005
Bibkey:
Cite (ACL):
Irina Temnikova, Ahmed Abdelali, Samy Hedaya, Stephan Vogel, and Aishah Al Daher. 2017. Interpreting Strategies Annotation in the WAW Corpus. In Proceedings of the Workshop Human-Informed Translation and Interpreting Technology, pages 36–43, Varna, Bulgaria. Association for Computational Linguistics, Shoumen, Bulgaria.
Cite (Informal):
Interpreting Strategies Annotation in the WAW Corpus (Temnikova et al., RANLP 2017)
Copy Citation:
PDF:
https://doi.org/10.26615/978-954-452-042-7_005