APA-RST: A Text Simplification Corpus with RST Annotations

Freya Hewett


Abstract
We present a corpus of parallel German-language simplified newspaper articles. The articles have been aligned at sentence level and annotated according to the Rhetorical Structure Theory (RST) framework. These RST annotated texts could shed light on structural aspects of text complexity and how simplifications work on a text-level.
Anthology ID:
2023.codi-1.23
Volume:
Proceedings of the 4th Workshop on Computational Approaches to Discourse (CODI 2023)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Michael Strube, Chloe Braud, Christian Hardmeier, Junyi Jessy Li, Sharid Loaiciga, Amir Zeldes
Venue:
CODI
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
173–179
Language:
URL:
https://aclanthology.org/2023.codi-1.23
DOI:
10.18653/v1/2023.codi-1.23
Bibkey:
Cite (ACL):
Freya Hewett. 2023. APA-RST: A Text Simplification Corpus with RST Annotations. In Proceedings of the 4th Workshop on Computational Approaches to Discourse (CODI 2023), pages 173–179, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
APA-RST: A Text Simplification Corpus with RST Annotations (Hewett, CODI 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.codi-1.23.pdf