Improving bilingual sub-sentential alignment by sampling-based transpotting

Li Gong, Aurélien Max, François Yvon


Abstract
In this article, we present a sampling-based approach to improve bilingual sub-sentential alignment in parallel corpora. This approach can be used to align parallel sentences on an as needed basis, and is able to accurately align newly available sentences. We evaluate the resulting alignments on several Machine Translation tasks. Results show that for the tasks considered here, our approach performs on par with the state-of-the-art statistical alignment pipeline giza++/Moses, and obtains superior results in a number of configurations, notably when aligning additional parallel sentence pairs carefully selected to match the test input.
Anthology ID:
2013.iwslt-papers.7
Volume:
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers
Month:
December 5-6
Year:
2013
Address:
Heidelberg, Germany
Editor:
Joy Ying Zhang
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2013.iwslt-papers.7
DOI:
Bibkey:
Cite (ACL):
Li Gong, Aurélien Max, and François Yvon. 2013. Improving bilingual sub-sentential alignment by sampling-based transpotting. In Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, Heidelberg, Germany.
Cite (Informal):
Improving bilingual sub-sentential alignment by sampling-based transpotting (Gong et al., IWSLT 2013)
Copy Citation:
PDF:
https://aclanthology.org/2013.iwslt-papers.7.pdf