Exploring Schema Generalizability of Text-to-SQL

Jieyu Li; Lu Chen; Ruisheng Cao; Su Zhu; Hongshen Xu; Zhi Chen; Hanchong Zhang; Kai Yu

doi:10.18653/v1/2023.findings-acl.87

Exploring Schema Generalizability of Text-to-SQL

Jieyu Li, Lu Chen, Ruisheng Cao, Su Zhu, Hongshen Xu, Zhi Chen, Hanchong Zhang, Kai Yu

Abstract

Exploring the generalizability of a text-to-SQL parser is essential for a system to automatically adapt the real-world databases. Previous investigation works mostly focus on lexical diversity, including the influence of the synonym and perturbations in both natural language questions and databases. However, the structural variability of database schema (DS), as a widely seen real-world scenario, is yet underexplored. Specifically, confronted with the same input question, the target SQL may be represented in different ways when the DS comes to a different structure. In this work, we provide in-depth discussions about the schema generalizability challenge of text-to-SQL tasks. We observe that current datasets are too templated to study schema generalization. To collect suitable test data, we propose a framework to generate novel text-to-SQL data via automatic and synchronous (DS, SQL) pair altering. When evaluating state-of-the-art text-to-SQL models on the synthetic samples, performance is significantly degraded, which demonstrates the limitation of current research regarding schema generalization.

Anthology ID:: 2023.findings-acl.87
Volume:: Findings of the Association for Computational Linguistics: ACL 2023
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1344–1360
Language:
URL:: https://aclanthology.org/2023.findings-acl.87
DOI:: 10.18653/v1/2023.findings-acl.87
Bibkey:
Cite (ACL):: Jieyu Li, Lu Chen, Ruisheng Cao, Su Zhu, Hongshen Xu, Zhi Chen, Hanchong Zhang, and Kai Yu. 2023. Exploring Schema Generalizability of Text-to-SQL. In Findings of the Association for Computational Linguistics: ACL 2023, pages 1344–1360, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Exploring Schema Generalizability of Text-to-SQL (Li et al., Findings 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.findings-acl.87.pdf

PDF Cite Search