Search papers, labs, and topics across Lattice.
The paper introduces Dial, a knowledge-grounded NL2SQL framework designed to handle the diverse SQL dialects found in enterprise database systems. Dial uses a dialect-aware logical query planning module, a hierarchical intent-aware knowledge base (HINT-KB), and an execution-driven debugging loop to generate dialect-specific SQL queries from natural language. Experiments on a new benchmark, DS-NL2SQL, demonstrate that Dial improves translation accuracy by 10.25% and dialect feature coverage by 15.77% compared to existing methods.
Stop struggling with SQL dialects: Dial offers a knowledge-grounded approach that boosts NL2SQL accuracy by 10% and feature coverage by 15% across diverse database systems.
Enterprises commonly deploy heterogeneous database systems, each of which owns a distinct SQL dialect with different syntax rules, built-in functions, and execution constraints. However, most existing NL2SQL methods assume a single dialect (e.g., SQLite) and struggle to produce queries that are both semantically correct and executable on target engines. Prompt-based approaches tightly couple intent reasoning with dialect syntax, rule-based translators often degrade native operators into generic constructs, and multi-dialect fine-tuning suffers from cross-dialect interference. In this paper, we present Dial, a knowledge-grounded framework for dialect-specific NL2SQL. Dial introduces: (1) a Dialect-Aware Logical Query Planning module that converts natural language into a dialect-aware logical query plan via operator-level intent decomposition and divergence-aware specification; (2) HINT-KB, a hierarchical intent-aware knowledge base that organizes dialect knowledge into (i) a canonical syntax reference, (ii) a declarative function repository, and (iii) a procedural constraint repository; and (3) an execution-driven debugging and semantic verification loop that separates syntactic recovery from logic auditing to prevent semantic drift. We construct DS-NL2SQL, a benchmark covering six major database systems with 2,218 dialect-specific test cases. Experimental results show that Dial consistently improves translation accuracy by 10.25% and dialect feature coverage by 15.77% over state-of-the-art baselines. The code is at https://github.com/weAIDB/Dial.