Reproducing a Manual Evaluation of the Simplicity of Text Simplification System Outputs

Maja Popović; Sheila Castilho; Rudali Huidrom; Anja Belz

Reproducing a Manual Evaluation of the Simplicity of Text Simplification System Outputs

Maja Popović, Sheila Castilho, Rudali Huidrom, Anya Belz

Abstract

In this paper we describe our reproduction study of the human evaluation of text simplic- ity reported by Nisioi et al. (2017). The work was carried out as part of the ReproGen Shared Task 2022 on Reproducibility of Evaluations in NLG. Our aim was to repeat the evaluation of simplicity for nine automatic text simplification systems with a different set of evaluators. We describe our experimental design together with the known aspects of the original experimental design and present the results from both studies. Pearson correlation between the original and reproduction scores is moderate to high (0.776). Inter-annotator agreement in the reproduction study is lower (0.40) than in the original study (0.66). We discuss challenges arising from the unavailability of certain aspects of the origi- nal set-up, and make several suggestions as to how reproduction of similar evaluations can be made easier in future.

Anthology ID:: 2022.inlg-genchal.12
Volume:: Proceedings of the 15th International Conference on Natural Language Generation: Generation Challenges
Month:: July
Year:: 2022
Address:: Waterville, Maine, USA and virtual meeting
Editors:: Samira Shaikh, Thiago Ferreira, Amanda Stent
Venue:: INLG
SIG:: SIGGEN
Publisher:: Association for Computational Linguistics
Note:
Pages:: 80–85
Language:
URL:: https://aclanthology.org/2022.inlg-genchal.12/
DOI:
Bibkey:
Cite (ACL):: Maja Popović, Sheila Castilho, Rudali Huidrom, and Anya Belz. 2022. Reproducing a Manual Evaluation of the Simplicity of Text Simplification System Outputs. In Proceedings of the 15th International Conference on Natural Language Generation: Generation Challenges, pages 80–85, Waterville, Maine, USA and virtual meeting. Association for Computational Linguistics.
Cite (Informal):: Reproducing a Manual Evaluation of the Simplicity of Text Simplification System Outputs (Popović et al., INLG 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.inlg-genchal.12.pdf

PDF Cite Search Fix data