Jie Shi
2025
Gen-SQL: Efficient Text-to-SQL By Bridging Natural Language Question And Database Schema With Pseudo-Schema
Jie Shi
|
Bo Xu
|
Jiaqing Liang
|
Yanghua Xiao
|
Jia Chen
|
Chenhao Xie
|
Peng Wang
|
Wei Wang
Proceedings of the 31st International Conference on Computational Linguistics
With the prevalence of Large Language Models (LLMs), recent studies have shifted paradigms and leveraged LLMs to tackle the challenging task of Text-to-SQL. Because of the complexity of real world databases, previous works adopt the retrieve-then-generate framework to retrieve relevant database schema and then to generate the SQL query. However, efficient embedding-based retriever suffers from lower retrieval accuracy, and more accurate LLM-based retriever is far more expensive to use, which hinders their applicability for broader applications. To overcome this issue, this paper proposes Gen-SQL, a novel generate-ground-regenerate framework, where we exploit prior knowledge from the LLM to enhance embedding-based retriever and reduce cost. Experiments on several datasets are conducted to demonstrate the effectiveness and scalability of our proposed method. We release our code and data at https://github.com/jieshi10/gensql.
Search
Fix data
Co-authors
- Jia Chen 1
- Jiaqing Liang 1
- Peng Wang 1
- Wei Wang 1
- Yanghua Xiao 1
- show all...