Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval Nandan Thakur author Jianmo Ni author Gustavo Hernandez Abrego author John Wieting author Jimmy Lin author Daniel Cer author 2024-06 text Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication thakur-etal-2024-leveraging 10.18653/v1/2024.naacl-long.426 https://aclanthology.org/2024.naacl-long.426/ 2024-06 7699 7724