Ruipu Luo
Also published as: 瑞璞 罗
2025
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
Zejun Li
|
Ruipu Luo
|
Jiwen Zhang
|
Minghui Qiu
|
Xuanjing Huang
|
Zhongyu Wei
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
2024
从多模态预训练到多模态大模型:架构、训练、评测、趋势概览(From Multi-Modal Pre-Training to Multi-Modal Large Language Models: An Overview of Architectures, Training,)
Zejun Li (李泽君)
|
Jiwen Zhang (张霁雯)
|
Ye Wang (王晔)
|
Mengfei Du (杜梦飞)
|
Qingwen Liu (刘晴雯)
|
Dianyi Wang (王殿仪)
|
Binhao Wu (吴斌浩)
|
Ruipu Luo (罗瑞璞)
|
Xuanjing Huang (黄萱菁)
|
Zhongyu Wei (魏忠钰)
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 2: Frontier Forum)
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
Mengfei Du
|
Binhao Wu
|
Jiwen Zhang
|
Zhihao Fan
|
Zejun Li
|
Ruipu Luo
|
Xuanjing Huang
|
Zhongyu Wei
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)