DUTMar 8, 2026arXiv:2603.07449

Dial: A Knowledge-Grounded Dialect-Specific NL2SQL System

Xiang Zhang, Hongming Xu, Le Zhou, Wei Zhou, Xuanhe Zhou, Guoliang Li, Yuyu Luo, Changdong Liu, Guorun Chen, Jiang Liao, Fan Wu

AI Summary

The paper introduces Dial, a knowledge-grounded NL2SQL framework designed to handle the diverse SQL dialects found in enterprise database systems. Dial uses a dialect-aware logical query planning module, a hierarchical intent-aware knowledge base (HINT-KB), and an execution-driven debugging loop to generate dialect-specific SQL queries from natural language. Experiments on a new benchmark, DS-NL2SQL, demonstrate that Dial improves translation accuracy by 10.25% and dialect feature coverage by 15.77% compared to existing methods.

Key Contribution

Stop struggling with SQL dialects: Dial offers a knowledge-grounded approach that boosts NL2SQL accuracy by 10% and feature coverage by 15% across diverse database systems.

Abstract

Enterprises commonly deploy heterogeneous database systems, each of which owns a distinct SQL dialect with different syntax rules, built-in functions, and execution constraints. However, most existing NL2SQL methods assume a single dialect (e.g., SQLite) and struggle to produce queries that are both semantically correct and executable on target engines. Prompt-based approaches tightly couple intent reasoning with dialect syntax, rule-based translators often degrade native operators into generic constructs, and multi-dialect fine-tuning suffers from cross-dialect interference. In this paper, we present Dial, a knowledge-grounded framework for dialect-specific NL2SQL. Dial introduces: (1) a Dialect-Aware Logical Query Planning module that converts natural language into a dialect-aware logical query plan via operator-level intent decomposition and divergence-aware specification; (2) HINT-KB, a hierarchical intent-aware knowledge base that organizes dialect knowledge into (i) a canonical syntax reference, (ii) a declarative function repository, and (iii) a procedural constraint repository; and (3) an execution-driven debugging and semantic verification loop that separates syntactic recovery from logic auditing to prevent semantic drift. We construct DS-NL2SQL, a benchmark covering six major database systems with 2,218 dialect-specific test cases. Experimental results show that Dial consistently improves translation accuracy by 10.25% and dialect feature coverage by 15.77% over state-of-the-art baselines. The code is at https://github.com/weAIDB/Dial.

Code Generation & Program Synthesis Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Dial: A Knowledge-Grounded Dialect-Specific NL2SQL System

Related Papers