The MIRACL Arabic-English statistical machine translation system for IWSLT 2010

Ines Turki Khemakhem, Salma Jamoussi, Abdelmajid Ben Hamadou


Abstract
This paper describes the MIRACL statistical Machine Translation system and the improvements that were developed during the IWSLT 2010 evaluation campaign. We participated to the Arabic to English BTEC tasks using a phrase-based statistical machine translation approach. In this paper, we first discuss some challenges in translating from Arabic to English and we explore various techniques to improve performances on a such task. Next, we present our solution for disambiguating the output of an Arabic morphological analyzer. In fact, The Arabic morphological analyzer used produces all possible morphological structures for each word, with an unique correct proposition. In this work we exploit the Arabic-English alignment to choose the correct segmented form and the correct morpho-syntactic features produced by our morphological analyzer.
Anthology ID:
2010.iwslt-evaluation.15
Volume:
Proceedings of the 7th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
December 2-3
Year:
2010
Address:
Paris, France
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
119–125
Language:
URL:
https://aclanthology.org/2010.iwslt-evaluation.15
DOI:
Bibkey:
Cite (ACL):
Ines Turki Khemakhem, Salma Jamoussi, and Abdelmajid Ben Hamadou. 2010. The MIRACL Arabic-English statistical machine translation system for IWSLT 2010. In Proceedings of the 7th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 119–125, Paris, France.
Cite (Informal):
The MIRACL Arabic-English statistical machine translation system for IWSLT 2010 (Turki Khemakhem et al., IWSLT 2010)
Copy Citation:
PDF:
https://aclanthology.org/2010.iwslt-evaluation.15.pdf