Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text Xiang Li author Jinglu Wang author Xiaohao Xu author Muqiao Yang author Fan Yang author Yizhou Zhao author Rita Singh author Bhiksha Raj author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication li-etal-2023-towards-noise 10.18653/v1/2023.emnlp-main.140 https://aclanthology.org/2023.emnlp-main.140/ 2023-12 2283 2296