On the Helpfulness of Document Context to Sentence Simplification

Renliang Sun, Zhe Lin, Xiaojun Wan


Abstract
Most of the research on text simplification is limited to sentence level nowadays. In this paper, we are the first to investigate the helpfulness of document context on sentence simplification and apply it to the sequence-to-sequence model. We firstly construct a sentence simplification dataset in which the contexts for the original sentence are provided by Wikipedia corpus. The new dataset contains approximately 116K sentence pairs with context. We then propose a new model that makes full use of the context information. Our model uses neural networks to learn the different effects of the preceding sentences and the following sentences on the current sentence and applies them to the improved transformer model. Evaluated on the newly constructed dataset, our model achieves 36.52 on SARI value, which outperforms the best performing model in the baselines by 2.46 (7.22%), indicating that context indeed helps improve sentence simplification. In the ablation experiment, we show that using either the preceding sentences or the following sentences as context can significantly improve simplification.
Anthology ID:
2020.coling-main.121
Volume:
Proceedings of the 28th International Conference on Computational Linguistics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Donia Scott, Nuria Bel, Chengqing Zong
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
1411–1423
Language:
URL:
https://aclanthology.org/2020.coling-main.121
DOI:
10.18653/v1/2020.coling-main.121
Bibkey:
Cite (ACL):
Renliang Sun, Zhe Lin, and Xiaojun Wan. 2020. On the Helpfulness of Document Context to Sentence Simplification. In Proceedings of the 28th International Conference on Computational Linguistics, pages 1411–1423, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):
On the Helpfulness of Document Context to Sentence Simplification (Sun et al., COLING 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.coling-main.121.pdf
Code
 rlsnlp/document-context-to-sentence-simplification
Data
Newsela