Hendrik Rosendahl
2019
Learning Bilingual Sentence Embeddings via Autoencoding and Computing Similarities with a Multilayer Perceptron
Yunsu Kim
|
Hendrik Rosendahl
|
Nick Rossenbach
|
Jan Rosendahl
|
Shahram Khadivi
|
Hermann Ney
Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)
We propose a novel model architecture and training algorithm to learn bilingual sentence embeddings from a combination of parallel and monolingual data. Our method connects autoencoding and neural machine translation to force the source and target sentence embeddings to share the same space without the help of a pivot language or an additional transformation. We train a multilayer perceptron on top of the sentence embeddings to extract good bilingual sentence pairs from nonparallel or noisy parallel data. Our approach shows promising performance on sentence alignment recovery and the WMT 2018 parallel corpus filtering tasks with only a single model.
2016
CharacTer: Translation Edit Rate on Character Level
Weiyue Wang
|
Jan-Thorsten Peter
|
Hendrik Rosendahl
|
Hermann Ney
Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers
Search
Fix data
Co-authors
- Hermann Ney 2
- Shahram Khadivi 1
- Yunsu Kim 1
- Jan-Thorsten Peter 1
- Jan Rosendahl 1
- show all...