Jiwen Zhang
Also published as: 霁雯 张
2025
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
Zejun Li
|
Ruipu Luo
|
Jiwen Zhang
|
Minghui Qiu
|
Xuanjing Huang
|
Zhongyu Wei
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
2024
从多模态预训练到多模态大模型:架构、训练、评测、趋势概览(From Multi-Modal Pre-Training to Multi-Modal Large Language Models: An Overview of Architectures, Training,)
Zejun Li (李泽君)
|
Jiwen Zhang (张霁雯)
|
Ye Wang (王晔)
|
Mengfei Du (杜梦飞)
|
Qingwen Liu (刘晴雯)
|
Dianyi Wang (王殿仪)
|
Binhao Wu (吴斌浩)
|
Ruipu Luo (罗瑞璞)
|
Xuanjing Huang (黄萱菁)
|
Zhongyu Wei (魏忠钰)
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 2: Frontier Forum)
Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Jiwen Zhang
|
Jihao Wu
|
Teng Yihua
|
Minghui Liao
|
Nuo Xu
|
Xiao Xiao
|
Zhongyu Wei
|
Duyu Tang
Findings of the Association for Computational Linguistics: EMNLP 2024
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
Mengfei Du
|
Binhao Wu
|
Jiwen Zhang
|
Zhihao Fan
|
Zejun Li
|
Ruipu Luo
|
Xuanjing Huang
|
Zhongyu Wei
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Co-authors
- Zhongyu Wei (魏忠钰) 4
- Xuan-Jing Huang (黄萱菁) 3
- Zejun Li (李泽君) 3
- Ruipu Luo (罗瑞璞) 3
- Mengfei Du (杜梦飞) 2
- show all...