Large-Scale and Multi-Perspective Opinion Summarization with Diverse Review Subsets

Han Jiang, Rui Wang, Zhihua Wei, Yu Li, Xinpeng Wang


Abstract
Opinion summarization is expected to digest larger review sets and provide summaries from different perspectives. However, most existing solutions are deficient in epitomizing extensive reviews and offering opinion summaries from various angles due to the lack of designs for information selection. To this end, we propose SubSumm, a supervised summarization framework for large-scale multi-perspective opinion summarization. SubSumm consists of a review sampling strategy set and a two-stage training scheme. The sampling strategies take sentiment orientation and contrastive information value into consideration, with which the review subsets from different perspectives and quality levels can be selected. Subsequently, the summarizer is encouraged to learn from the sub-optimal and optimal subsets successively in order to capitalize on the massive input. Experimental results on AmaSum and Rotten Tomatoes datasets demonstrate that SubSumm is adept at generating pros, cons, and verdict summaries from hundreds of input reviews. Furthermore, our in-depth analysis verifies that the advanced selection of review subsets and the two-stage training scheme are vital to boosting the summarization performance.
Anthology ID:
2023.findings-emnlp.375
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2023
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5641–5656
Language:
URL:
https://aclanthology.org/2023.findings-emnlp.375
DOI:
10.18653/v1/2023.findings-emnlp.375
Bibkey:
Cite (ACL):
Han Jiang, Rui Wang, Zhihua Wei, Yu Li, and Xinpeng Wang. 2023. Large-Scale and Multi-Perspective Opinion Summarization with Diverse Review Subsets. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 5641–5656, Singapore. Association for Computational Linguistics.
Cite (Informal):
Large-Scale and Multi-Perspective Opinion Summarization with Diverse Review Subsets (Jiang et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-emnlp.375.pdf
Video:
 https://aclanthology.org/2023.findings-emnlp.375.mp4