Text-to-SQL: a methodical review of challenges and models

Yükleniyor...
Küçük Resim

Tarih

2024-05-20

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

TÜBİTAK

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Araştırma projeleri

Organizasyon Birimleri

Dergi sayısı

Özet

This survey focuses on Text-to-SQL, automated translation of natural language queries into SQL queries. Initially, we describe the problem and its main challenges. Then, by following the PRISMA systematic review methodology, we survey the existing Text-to-SQL review papers in the literature. We apply the same method to extract proposed Text-to-SQL models and classify them with respect to used evaluation metrics and benchmarks. We highlight the accuracies achieved by various models on Text-to-SQL datasets and discuss execution-guided evaluation strategies. We present insights into model training times and implementations of different models. We also explore the availability of Text-to-SQL datasets in non-English languages. Additionally, we focus on large language model (LLM) based approaches for the Text-to-SQL task, where we examine LLM-based studies in the literature and subsequently evaluate the LLMs on the cross-domain Spider dataset. Finally, we conclude with a discussion of future directions for Text-to-SQL research, identifying potential areas of improvement and advancements in this field.

Açıklama

Anahtar Kelimeler

Text-to-SQL, Large language model, Natural language processing, Deep learning, Computational linguistics, Large datasets, Natural language processing systems, Automated translation, Language model, Language processing, Natural language queries, Natural languages, SQL query

Kaynak

Turkish Journal of Electrical Engineering and Computer Sciences

WoS Q Değeri

Q3

Scopus Q Değeri

Q2

Cilt

32

Sayı

3

Künye

Kanburoğlu, A. B. & Tek, F. B. (2024). Text-to-SQL: a methodical review of challenges and models. Turkish Journal of Electrical Engineering and Computer Sciences, 32(3), 403-419. doi:10.55730/1300-0632.4077