Boosting Text-to-SQL through Multi-grained Error Identification

Bo Xu, Shufei Li, Hongyu Jing, Ming Du, Hui Song, Hongya Wang, Yanghua Xiao


Abstract
Text-to-SQL is a technology that converts natural language questions into executable SQL queries, allowing users to query and manage relational databases more easily. In recent years, large language models have significantly advanced the development of text-to-SQL. However, existing methods often overlook validation of the generated results during the SQL generation process. Current error identification methods are mainly divided into self-correction approaches based on large models and feedback methods based on SQL execution, both of which have limitations. We categorize SQL errors into three main types: system errors, skeleton errors, and value errors, and propose a multi-grained error identification method. Experimental results demonstrate that this method can be integrated as a plugin into various methods, providing effective error identification and correction capabilities.
Anthology ID:
2025.coling-main.289
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4282–4292
Language:
URL:
https://aclanthology.org/2025.coling-main.289/
DOI:
Bibkey:
Cite (ACL):
Bo Xu, Shufei Li, Hongyu Jing, Ming Du, Hui Song, Hongya Wang, and Yanghua Xiao. 2025. Boosting Text-to-SQL through Multi-grained Error Identification. In Proceedings of the 31st International Conference on Computational Linguistics, pages 4282–4292, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
Boosting Text-to-SQL through Multi-grained Error Identification (Xu et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.289.pdf