Hierarchical Catalogue Generation for Literature Review: A Benchmark

Kun Zhu (朱坤); Xiaocheng Feng; Xiachong Feng; Yingsheng Wu; Bing Qin (秦兵)

doi:10.18653/v1/2023.findings-emnlp.453

Hierarchical Catalogue Generation for Literature Review: A Benchmark

Kun Zhu, Xiaocheng Feng, Xiachong Feng, Yingsheng Wu, Bing Qin

Abstract

Scientific literature review generation aims to extract and organize important information from an abundant collection of reference papers and produces corresponding reviews while lacking a clear and logical hierarchy. We observe that a high-quality catalogue-guided generation process can effectively alleviate this problem. Therefore, we present an atomic and challenging task named Hierarchical Catalogue Generation for Literature Review as the first step for review generation, which aims to produce a hierarchical catalogue of a review paper given various references. We construct a novel English Hierarchical Catalogues of Literature Reviews Dataset with 7.6k literature review catalogues and 389k reference papers. To accurately assess the model performance, we design two evaluation metrics for informativeness and similarity to ground truth from semantics and structure. Our extensive analyses verify the high quality of our dataset and the effectiveness of our evaluation metrics. We further benchmark diverse experiments on state-of-the-art summarization models like BART and large language models like ChatGPT to evaluate their capabilities. We further discuss potential directions for this task to motivate future research.

Anthology ID:: 2023.findings-emnlp.453
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2023
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Houda Bouamor, Juan Pino, Kalika Bali
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6790–6804
Language:
URL:: https://aclanthology.org/2023.findings-emnlp.453/
DOI:: 10.18653/v1/2023.findings-emnlp.453
Bibkey:
Cite (ACL):: Kun Zhu, Xiaocheng Feng, Xiachong Feng, Yingsheng Wu, and Bing Qin. 2023. Hierarchical Catalogue Generation for Literature Review: A Benchmark. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 6790–6804, Singapore. Association for Computational Linguistics.
Cite (Informal):: Hierarchical Catalogue Generation for Literature Review: A Benchmark (Zhu et al., Findings 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.findings-emnlp.453.pdf

PDF Cite Search Fix data