Statistical Machine Translation of European Parliamentary Speeches

David Vilar, Evgeny Matusov, Sasa Hasan, Richard Zens, Hermann Ney


Abstract
In this paper we present the ongoing work at RWTH Aachen University for building a speech-to-speech translation system within the TC-Star project. The corpus we work on consists of parliamentary speeches held in the European Plenary Sessions. To our knowledge, this is the first project that focuses on speech-to-speech translation applied to a real-life task. We describe the statistical approach used in the development of our system and analyze its performance under different conditions: dealing with syntactically correct input, dealing with the exact transcription of speech and dealing with the (noisy) output of an automatic speech recognition system. Experimental results show that our system is able to perform adequately in each of these conditions.
Anthology ID:
2005.mtsummit-papers.34
Volume:
Proceedings of Machine Translation Summit X: Papers
Month:
September 13-15
Year:
2005
Address:
Phuket, Thailand
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
259–266
Language:
URL:
https://aclanthology.org/2005.mtsummit-papers.34
DOI:
Bibkey:
Cite (ACL):
David Vilar, Evgeny Matusov, Sasa Hasan, Richard Zens, and Hermann Ney. 2005. Statistical Machine Translation of European Parliamentary Speeches. In Proceedings of Machine Translation Summit X: Papers, pages 259–266, Phuket, Thailand.
Cite (Informal):
Statistical Machine Translation of European Parliamentary Speeches (Vilar et al., MTSummit 2005)
Copy Citation:
PDF:
https://aclanthology.org/2005.mtsummit-papers.34.pdf