What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? Yan Zeng author Hanbo Zhang author Jiani Zheng author Jiangnan Xia author Guoqiang Wei author Yang Wei author Yuchen Zhang author Tao Kong author Ruihua Song author 2024-06 text Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication zeng-etal-2024-matters 10.18653/v1/2024.naacl-long.440 https://aclanthology.org/2024.naacl-long.440/ 2024-06 7937 7964