Integrating Transformer and Paraphrase Rules for Sentence Simplification

Sanqiang Zhao, Rui Meng, Daqing He, Andi Saptono, Bambang Parmanto


Abstract
Sentence simplification aims to reduce the complexity of a sentence while retaining its original meaning. Current models for sentence simplification adopted ideas from machine translation studies and implicitly learned simplification mapping rules from normal-simple sentence pairs. In this paper, we explore a novel model based on a multi-layer and multi-head attention architecture and we propose two innovative approaches to integrate the Simple PPDB (A Paraphrase Database for Simplification), an external paraphrase knowledge base for simplification that covers a wide range of real-world simplification rules. The experiments show that the integration provides two major benefits: (1) the integrated model outperforms multiple state-of-the-art baseline models for sentence simplification in the literature (2) through analysis of the rule utilization, the model seeks to select more accurate simplification rules. The code and models used in the paper are available at https://github.com/Sanqiang/text_simplification.
Anthology ID:
D18-1355
Volume:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Month:
October-November
Year:
2018
Address:
Brussels, Belgium
Venue:
EMNLP
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
3164–3173
Language:
URL:
https://aclanthology.org/D18-1355
DOI:
10.18653/v1/D18-1355
Bibkey:
Cite (ACL):
Sanqiang Zhao, Rui Meng, Daqing He, Andi Saptono, and Bambang Parmanto. 2018. Integrating Transformer and Paraphrase Rules for Sentence Simplification. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3164–3173, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
Integrating Transformer and Paraphrase Rules for Sentence Simplification (Zhao et al., EMNLP 2018)
Copy Citation:
PDF:
https://aclanthology.org/D18-1355.pdf
Video:
 https://vimeo.com/305927122
Code
 Sanqiang/text_simplification
Data
ASSETNewselaTurkCorpus