Schema-Guided Natural Language Generation

Yuheng Du; Shereen Oraby; Vittorio Perera; Minmin Shen; Anjali Narayan-Chen; Tagyoung Chung; Anushree Venkatesh; Dilek Hakkani-Tur

doi:10.18653/v1/2020.inlg-1.35

Schema-Guided Natural Language Generation

Yuheng Du, Shereen Oraby, Vittorio Perera, Minmin Shen, Anjali Narayan-Chen, Tagyoung Chung, Anushree Venkatesh, Dilek Hakkani-Tur

Abstract

Neural network based approaches to data-to-text natural language generation (NLG) have gained popularity in recent years, with the goal of generating a natural language prompt that accurately realizes an input meaning representation. To facilitate the training of neural network models, researchers created large datasets of paired utterances and their meaning representations. However, the creation of such datasets is an arduous task and they mostly consist of simple meaning representations composed of slot and value tokens to be realized. These representations do not include any contextual information that an NLG system can use when trying to generalize, such as domain information and descriptions of slots and values. In this paper, we present the novel task of Schema-Guided Natural Language Generation (SG-NLG). Here, the goal is still to generate a natural language prompt, but in SG-NLG, the input MRs are paired with rich schemata providing contextual information. To generate a dataset for SG-NLG we re-purpose an existing dataset for another task: dialog state tracking, which includes a large and rich schema spanning multiple different attributes, including information about the domain, user intent, and slot descriptions. We train different state-of-the-art models for neural natural language generation on this dataset and show that in many cases, including rich schema information allows our models to produce higher quality outputs both in terms of semantics and diversity. We also conduct experiments comparing model performance on seen versus unseen domains, and present a human evaluation demonstrating high ratings for overall output quality.

Anthology ID:: 2020.inlg-1.35
Volume:: Proceedings of the 13th International Conference on Natural Language Generation
Month:: December
Year:: 2020
Address:: Dublin, Ireland
Editors:: Brian Davis, Yvette Graham, John Kelleher, Yaji Sripada
Venue:: INLG
SIG:: SIGGEN
Publisher:: Association for Computational Linguistics
Note:
Pages:: 283–295
Language:
URL:: https://aclanthology.org/2020.inlg-1.35
DOI:: 10.18653/v1/2020.inlg-1.35
Bibkey:
Cite (ACL):: Yuheng Du, Shereen Oraby, Vittorio Perera, Minmin Shen, Anjali Narayan-Chen, Tagyoung Chung, Anushree Venkatesh, and Dilek Hakkani-Tur. 2020. Schema-Guided Natural Language Generation. In Proceedings of the 13th International Conference on Natural Language Generation, pages 283–295, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):: Schema-Guided Natural Language Generation (Du et al., INLG 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.inlg-1.35.pdf
Code: alexa/schema-guided-nlg
Data: SG-NLG

PDF Cite Search Code