The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation Dung Nguyen Manh author Nam Le Hai author Anh T V Dau author Anh Minh Nguyen author Khanh Nghiem author Jin Guo author Nghi D Q Bui author 2023-12 text Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023) Liling Tan editor Dmitrijs Milajevs editor Geeticka Chauhan editor Jeremy Gwinnup editor Elijah Rippeth editor Association for Computational Linguistics Singapore conference publication manh-etal-2023-vault 10.18653/v1/2023.nlposs-1.25 https://aclanthology.org/2023.nlposs-1.25/ 2023-12 219 244