English-to-Japanese Multimodal Machine Translation Based on Image-Text Matching of Lecture Videos

English-to-Japanese Multimodal Machine Translation Based on Image-Text Matching of Lecture Videos Ayu Teramen author Takumi Ohtsuka author Risa Kondo author Tomoyuki Kajiwara author Takashi Ninomiya author 2024-08 text Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR) Jing Gu editor Tsu-Jui (Ray) Fu editor Drew Hudson editor Asli Celikyilmaz editor William Wang editor Association for Computational Linguistics Bangkok, Thailand conference publication teramen-etal-2024-english 10.18653/v1/2024.alvr-1.7 https://aclanthology.org/2024.alvr-1.7/ 2024-08 86 91