Chris Palaguachi
2024
Long-Form Analogy Evaluation Challenge
Bhavya Bhavya
|
Chris Palaguachi
|
Yang Zhou
|
Suma Bhat
|
ChengXiang Zhai
Proceedings of the 17th International Natural Language Generation Conference: Generation Challenges
Given the practical applications of analogies, recent work has studied analogy generation to explain concepts. However, not all generated analogies are of high quality and it is unclear how to measure the quality of this new kind of generated text. To address this challenge, we propose a shared task on automatically evaluating the quality of generated analogies based on seven comprehensive criteria. For this, we will set up a leader board based on our dataset annotated with manual ratings along the seven criteria, and provide a baseline solution leveraging GPT-4. We hope that this task would advance the progress in development of new evaluation metrics and methods for analogy generation in natural language, particularly for education.