Problems in Current Text Simplification Research: New Data Can Help

Wei Xu, Chris Callison-Burch, Courtney Napoles


Abstract
Simple Wikipedia has dominated simplification research in the past 5 years. In this opinion paper, we argue that focusing on Wikipedia limits simplification research. We back up our arguments with corpus analysis and by highlighting statements that other researchers have made in the simplification literature. We introduce a new simplification dataset that is a significant improvement over Simple Wikipedia, and present a novel quantitative-comparative approach to study the quality of simplification data resources.
Anthology ID:
Q15-1021
Volume:
Transactions of the Association for Computational Linguistics, Volume 3
Month:
Year:
2015
Address:
Cambridge, MA
Editors:
Michael Collins, Lillian Lee
Venue:
TACL
SIG:
Publisher:
MIT Press
Note:
Pages:
283–297
Language:
URL:
https://aclanthology.org/Q15-1021
DOI:
10.1162/tacl_a_00139
Bibkey:
Cite (ACL):
Wei Xu, Chris Callison-Burch, and Courtney Napoles. 2015. Problems in Current Text Simplification Research: New Data Can Help. Transactions of the Association for Computational Linguistics, 3:283–297.
Cite (Informal):
Problems in Current Text Simplification Research: New Data Can Help (Xu et al., TACL 2015)
Copy Citation:
PDF:
https://aclanthology.org/Q15-1021.pdf
Video:
 https://vimeo.com/150290363
Data
Newsela