Kento Tanaka

2022

We focus on image description and a corresponding assessment system for language learners. To achieve automatic assessment of image description, we construct a novel dataset, the Language Learner Image Description (LLID) dataset, which consists of images, their descriptions, and assessment annotations. Then, we propose a novel task of automatic error correction for image description, and we develop a baseline model that encodes multimodal information from a learner sentence with an image and accurately decodes a corrected sentence. Our experimental results show that the developed model can revise errors that cannot be revised without an image.

Co-authors

Masatake Dantsuji 1
Hirotaka Kameko 1
Hiroaki Nanjo 1
Taichi Nishimura 1
Keisuke Shirai 1

Venues

LREC1

Fix author