RB-SQL: The Ultimate NL2SQL Framework for Complex Queries

The task of NL2SQL is to convert natural language questions into SQL queries in order to retrieve answers from a database. Existing methods using Large Language Models (LLMs) to guide SQL generation face challenges when dealing with large databases and complex multi-table queries, especially in handling redundant information and improving prompt engineering efficiency.

The RB-SQL Framework

To address the above issues, the RB-SQL framework is proposed, which contains three modules:

Table-Retriever

Retrieves the tables most relevant to the question.

Column-Retriever

Further retrieves relevant columns from the retrieved tables.

SQL-Skeleton-Retriever

Searches for a small number of examples with similar SQL skeletons and introduces the SQL skeletons into the example organization to enhance the in-context learning process.

The RB-SQL framework utilizes the Dense Passage Retrieval (DPR) model to retrieve relevant tables, columns, and examples to construct effective prompt engineering. In addition, the framework introduces SQL skeletons as an intermediate step in the example organization to guide the correct SQL generation process.

Workflow of the Modules

Table-Retriever

Computes the similarity between the question and tables
Retrieves tables highly relevant to the question
Uses BERT to separately encode the question and tables
Employs post-interaction based on MaxSim to calculate similarity scores

Column-Retriever

Retrieves columns highly relevant to the question

SQL-Skeleton-Retriever

Searches for a small number of examples with similar SQL skeletons
Introduces SQL skeletons into the example organization

Experimental Results

Through experiments on the public datasets BIRD and Spider, the results show that the RB-SQL model outperforms several competing baselines in performance, including GPT-4, DIN-SQL, and DAIL-SQL.

Model	BIRD
Model	EX	VES
ChatGPT + CoT	36.64	42.30
GPT-4	46.35	49.77
DIN-SQL + GPT-4	50.72	58.79
DAIL-SQL + GPT-4	54.76	56.08
RB-SQL + GPT-4	58.07	59.72

Table 2:EX and VES on dev set of BIRD dataset.

Model	Spider
Model	EX (dev)	EX (test)
C3 + ChatGPT	81.80	82.30
DIN-SQL + GPT-4	82.80	85.30
DAIL-SQL + GPT-4	84.40	86.60
RB-SQL + GPT-4	84.91	85.68
+ Generated Evidence	85.89	86.73

Table 3:EX on both dev and test set of Spider.

Ablation studies were also conducted, demonstrating that all modules in the RB-SQL framework play an important role in performance improvement.

Method	BIRD
Method	EX	VES
(1) RB-SQL + GPT-4	58.07	59.72
(2) GPT-4	46.35(↓11.72)	49.77(↓9.95)
(3) + Table-Retriever & Column-Retriever	54.06(↓4.01)	56.11(↓3.61)
(4) + SQL skeleton(example organization)	54.48(↓3.59)	56.38(↓3.34)
(5) + SQL-Skeleton-Retriever(example selection)	55.19(↓2.88)	56.81(↓2.91)
(6) + Error correction	58.07(↓0.0)	59.72(↓0.0)

Table 4:Results of ablation study on BIRD. “+” means adding module on the basis of the previous row.

Conclusion

The RB-SQL framework provides an effective approach for handling large databases and complex multi-table queries in the NL2SQL task. By leveraging retrieval-based modules and introducing SQL skeletons, RB-SQL enhances the in-context learning process and guides the correct SQL generation. Experimental results validate the superior performance of RB-SQL compared to existing baselines.

RB-SQL: The Ultimate NL2SQL Framework for Complex Queries

The RB-SQL Framework

Table-Retriever

Column-Retriever

SQL-Skeleton-Retriever

Workflow of the Modules

Table-Retriever

Column-Retriever

SQL-Skeleton-Retriever

Experimental Results

Conclusion

How to Mark a Product as Not Available on Beacons.ai: A Step-by-Step Guide

How AI Can Transform Your Resume: Best Tools Reviewed

Build LLM Apps Fast: Dify + OpenRouter + K8s Platform Guide

10 AI Writing Assistants That Will Transform Your Writing

Compare DuckDB vs. Polars: Which Data Analysis Tool Wins?

Try Imagen 3: Google’s AI Model Outshines DALL-E 3!

Leave a Reply Cancel reply

Join 40,000+ AI Enthusiasts Receiving Our
Weekly NobleFilt Newsletter

Subscribe now and get exclusive access to our free guide: “10 Game-Changing AI Tools to Supercharge Your Productivity!”

The RB-SQL Framework

Table-Retriever

Column-Retriever

SQL-Skeleton-Retriever

Workflow of the Modules

Table-Retriever

Column-Retriever

SQL-Skeleton-Retriever

Experimental Results

Conclusion

Similar Posts

Leave a Reply Cancel reply

Join 40,000+ AI Enthusiasts Receiving OurWeekly NobleFilt Newsletter

Subscribe now and get exclusive access to our free guide: “10 Game-Changing AI Tools to Supercharge Your Productivity!”

Join 40,000+ AI Enthusiasts Receiving Our
Weekly NobleFilt Newsletter