Elsie Li Chen Ong

2024

According to the internationally recognized PIRLS (Progress in International Reading Literacy Study) assessment standards, reading comprehension questions should require not only information retrieval, but also higher-order processes such as inferencing, interpreting and evaluation. However, these kinds of questions are often not available in large quantities for training question generation models. This paper investigates whether pre-trained Large Language Models (LLMs) can produce higher-order questions. Human assessment on a Chinese dataset shows that few-shot LLM prompting generates more usable and higher-order questions than two competitive neural baselines.

Co-authors

Samuel Kai Wah Chu 1
Yu Yan Lam 1
John Sie Yuen Lee 1
Yin Poon 1
Wing Lam Suen 1

Venues

SIGHAN1
WS1

Fix author