Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models Xuanyu Lei author Zonghan Yang author Xinrui Chen author Peng Li author Yang Liu author 2025-01 text Proceedings of the 31st International Conference on Computational Linguistics Owen Rambow editor Leo Wanner editor Marianna Apidianaki editor Hend Al-Khalifa editor Barbara Di Eugenio editor Steven Schockaert editor Association for Computational Linguistics Abu Dhabi, UAE conference publication lei-etal-2025-scaffolding https://aclanthology.org/2025.coling-main.195/ 2025-01 2886 2903