Toward a Better Story End: Collecting Human Evaluation with Reasons

Yusuke Mori, Hiroaki Yamane, Yusuke Mukuta, Tatsuya Harada


Abstract
Creativity is an essential element of human nature used for many activities, such as telling a story. Based on human creativity, researchers have attempted to teach a computer to generate stories automatically or support this creative process. In this study, we undertake the task of story ending generation. This is a relatively new task, in which the last sentence of a given incomplete story is automatically generated. This is challenging because, in order to predict an appropriate ending, the generation method should comprehend the context of events. Despite the importance of this task, no clear evaluation metric has been established thus far; hence, it has remained an open problem. Therefore, we study the various elements involved in evaluating an automatic method for generating story endings. First, we introduce a baseline hierarchical sequence-to-sequence method for story ending generation. Then, we conduct a pairwise comparison against human-written endings, in which annotators choose the preferable ending. In addition to a quantitative evaluation, we conduct a qualitative evaluation by asking annotators to specify the reason for their choice. From the collected reasons, we discuss what elements the evaluation should focus on, to thereby propose effective metrics for the task.
Anthology ID:
W19-8646
Original:
W19-8646v1
Version 2:
W19-8646v2
Volume:
Proceedings of the 12th International Conference on Natural Language Generation
Month:
October–November
Year:
2019
Address:
Tokyo, Japan
Editors:
Kees van Deemter, Chenghua Lin, Hiroya Takamura
Venue:
INLG
SIG:
SIGGEN
Publisher:
Association for Computational Linguistics
Note:
Pages:
383–390
Language:
URL:
https://aclanthology.org/W19-8646
DOI:
10.18653/v1/W19-8646
Bibkey:
Cite (ACL):
Yusuke Mori, Hiroaki Yamane, Yusuke Mukuta, and Tatsuya Harada. 2019. Toward a Better Story End: Collecting Human Evaluation with Reasons. In Proceedings of the 12th International Conference on Natural Language Generation, pages 383–390, Tokyo, Japan. Association for Computational Linguistics.
Cite (Informal):
Toward a Better Story End: Collecting Human Evaluation with Reasons (Mori et al., INLG 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-8646.pdf
Supplementary attachment:
 W19-8646.Supplementary_Attachment.zip