Low-Resource Corpus Filtering Using Multilingual Sentence Embeddings Vishrav Chaudhary author Yuqing Tang author Francisco Guzmán author Holger Schwenk author Philipp Koehn author 2019-08 text Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2) Ondřej Bojar editor Rajen Chatterjee editor Christian Federmann editor Mark Fishel editor Yvette Graham editor Barry Haddow editor Matthias Huck editor Antonio Jimeno Yepes editor Philipp Koehn editor André Martins editor Christof Monz editor Matteo Negri editor Aurélie Névéol editor Mariana Neves editor Matt Post editor Marco Turchi editor Karin Verspoor editor Association for Computational Linguistics Florence, Italy conference publication chaudhary-etal-2019-low 10.18653/v1/W19-5435 https://aclanthology.org/W19-5435/ 2019-08 261 266