Termite Italian Text-to-SQL: A CALAMITA Challenge

Federico Ranaldi, Elena Sofia Ruzzetti, Dario Onorati, Fabio Massimo Zanzotto, Leonardo Ranaldi


Abstract
We introduce Termite, which is a definitely unseen resource for evaluating Text-to-SQL in Italian. Specifically,we transfer evaluation pipelines beyond English, proposing novel, definitely unseen resources that avoid data-contamination phenomena while assessing the ability of models to perform Text-to-SQL tasks when natural language queries are written in Italian. We establish an evaluation grid based on execution accuracy.
Anthology ID:
2024.clicit-1.130
Volume:
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
Month:
December
Year:
2024
Address:
Pisa, Italy
Editors:
Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Rachele Sprugnoli
Venue:
CLiC-it
SIG:
Publisher:
CEUR Workshop Proceedings
Note:
Pages:
1176–1183
Language:
URL:
https://aclanthology.org/2024.clicit-1.130/
DOI:
Bibkey:
Cite (ACL):
Federico Ranaldi, Elena Sofia Ruzzetti, Dario Onorati, Fabio Massimo Zanzotto, and Leonardo Ranaldi. 2024. Termite Italian Text-to-SQL: A CALAMITA Challenge. In Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024), pages 1176–1183, Pisa, Italy. CEUR Workshop Proceedings.
Cite (Informal):
Termite Italian Text-to-SQL: A CALAMITA Challenge (Ranaldi et al., CLiC-it 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.clicit-1.130.pdf