FGCSQL: A Three-Stage Pipeline for Large Language Model-driven Chinese Text-to-SQL

Guanyu Jiang
Weibin Li
Chenglong Yu
Zixuan Zhu
Wei LI

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent advances in large language models have driven major breakthroughs in Text-to-SQL tasks. However, many challenges hinder the use of SQL parsers for cross-language tasks. In this article, we introduce FGCSQL, a novel three-stage pipeline framework to deal with three challenges: cross-language schema linking, SQL parsing potential of LLM, and error propagation in SQL parsers, in which the framework uniquely incorporates a Filtering Encoder to eliminate irrelevant database schema items, harnessing a pre-trained Generative Large Language Model fine-tuned on a carefully structured dataset for enhanced SQL parsing. Finally, a Correcting Decoder addresses error propagation, culminating in a robust system for semantic parsing tasks. Tested on the CSpider dataset, the FGCSQL showcases a substantial improvement in Exact-set-Match(EM) accuracy and EXecution accuracy(EX) metrics, validating the pipeline’s architecture’s effectiveness in mitigating the challenges typically confronted in Text-to-SQL conversion, especially in cross-lingual contexts. FGCSQL outstrips existing methods in execution precision, indicating the validity of our proposed method.

Version published to 10.20944/preprints202502.1059.v1
Feb 14, 2025

Optimizing AI Language Models: A Study of ChatGPT-4 vs. ChatGPT-4o

This article has 5 authors:
1. Md Nurul Absar Siddiky
2. Muhammad Enayetur Rahman
3. MD Fayaz Bin Hossen
4. Muhammad Rezaur Rahman
5. Md. Shahadat Jaman
This article has no evaluationsLatest version Feb 3, 2025
Hierarchical Neural Schema Construction for Enhanced Contextual Understanding in Large Language Models

This article has 5 authors:
1. Graeme Mousem
2. Robert Montague
3. Jonathan Blackwood
4. Frederick Langford
5. Matthew Abercrombie
This article has no evaluationsLatest version Jan 20, 2025
Metadata Conditioning Accelerates Language Model Pre-training

This article has 6 authors:
1. Tianyu Gao
2. Alexander Wettig
3. Luxi He
4. Yihe Dong
5. Sadhika Malladi
6. Danqi Chen
This article has no evaluationsLatest version Jan 15, 2025

Listed in

Abstract

Article activity feed

Related articles

Optimizing AI Language Models: A Study of ChatGPT-4 vs. ChatGPT-4o

Hierarchical Neural Schema Construction for Enhanced Contextual Understanding in Large Language Models

Metadata Conditioning Accelerates Language Model Pre-training