UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks Yuanhao Xiong author Yixin Nie author Haotian Liu author Boxin Wang author Jun Chen author Rong Jin author Cho-Jui Hsieh author Lorenzo Torresani author Jie Lei author 2024-11 text Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing Yaser Al-Onaizan editor Mohit Bansal editor Yun-Nung Chen editor Association for Computational Linguistics Miami, Florida, USA conference publication xiong-etal-2024-unicorn 10.18653/v1/2024.emnlp-main.722 https://aclanthology.org/2024.emnlp-main.722/ 2024-11 12983 12997