A Sentence Alignment Approach to Document Alignment and Multi-faceted Filtering for Curating Parallel Sentence Pairs from Web-crawled Data Steinthor Steingrimsson author 2023-12 text Proceedings of the Eighth Conference on Machine Translation Philipp Koehn editor Barry Haddow editor Tom Kocmi editor Christof Monz editor Association for Computational Linguistics Singapore conference publication steingrimsson-2023-sentence 10.18653/v1/2023.wmt-1.38 https://aclanthology.org/2023.wmt-1.38/ 2023-12 366 374