Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity

Yang Janet Liu; Amir Zeldes

doi:10.18653/v1/2023.eacl-main.227

Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity

Abstract

Recent advances in discourse parsing performance create the impression that, as in other NLP tasks, performance for high-resource languages such as English is finally becoming reliable. In this paper we demonstrate that this is not the case, and thoroughly investigate the impact of data diversity on RST parsing stability. We show that state-of-the-art architectures trained on the standard English newswire benchmark do not generalize well, even within the news domain. Using the two largest RST corpora of English with text from multiple genres, we quantify the impact of genre diversity in training data for achieving generalization to text types unseen during training. Our results show that a heterogeneous training regime is critical for stable and generalizable models, across parser architectures. We also provide error analyses of model outputs and out-of-domain performance. To our knowledge, this study is the first to fully evaluate cross-corpus RST parsing generalizability on complete trees, examine between-genre degradation within an RST corpus, and investigate the impact of genre diversity in training data composition.

Anthology ID:: 2023.eacl-main.227
Volume:: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics
Month:: May
Year:: 2023
Address:: Dubrovnik, Croatia
Editors:: Andreas Vlachos, Isabelle Augenstein
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3112–3130
Language:
URL:: https://aclanthology.org/2023.eacl-main.227/
DOI:: 10.18653/v1/2023.eacl-main.227
Bibkey:
Cite (ACL):: Yang Janet Liu and Amir Zeldes. 2023. Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pages 3112–3130, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):: Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity (Liu & Zeldes, EACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.eacl-main.227.pdf
Video:: https://aclanthology.org/2023.eacl-main.227.mp4

PDF Cite Search Video Fix data