Hangu Yeo
2022
Addressing Limitations of Encoder-Decoder Based Approach to Text-to-SQL
Octavian Popescu
|
Irene Manotas
|
Ngoc Phuoc An Vo
|
Hangu Yeo
|
Elahe Khorashani
|
Vadim Sheinin
Proceedings of the 29th International Conference on Computational Linguistics
Most attempts on Text-to-SQL task using encoder-decoder approach show a big problem of dramatic decline in performance for new databases. For the popular Spider dataset, despite models achieving 70% accuracy on its development or test sets, the same models show a huge decline below 20% accuracy for unseen databases. The root causes for this problem are complex and they cannot be easily fixed by adding more manually created training. In this paper we address the problem and propose a solution that is a hybrid system using automated training-data augmentation technique. Our system consists of a rule-based and a deep learning components that interact to understand crucial information in a given query and produce correct SQL as a result. It achieves double-digit percentage improvement for databases that are not part of the Spider corpus.
2018
QUEST: A Natural Language Interface to Relational Databases
Vadim Sheinin
|
Elahe Khorashani
|
Hangu Yeo
|
Kun Xu
|
Ngoc Phuoc An Vo
|
Octavian Popescu
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Search
Co-authors
- Vadim Sheinin 2
- Elahe Khorashani 2
- Ngoc Phuoc An Vo 2
- Octavian Popescu 2
- Kun Xu 1
- show all...