Tsinghua AIHunanNational Technology Innovation CenterNUDTFeb 23, 2026arXiv:2602.19490

FuzzySQL: Uncovering Hidden Vulnerabilities in DBMS Special Features with LLM-Driven Fuzzing

Yongxin Chen, Zhiyuan Jiang, Zhiyuan Jiang, Chao Zhang, Chao Zhang, Haoran Xu, Haoran Xu, Shenglin Xu, Shenglin Xu, Jianping Tang, Zheming Li, Zheming Li, Peidai Xie, Peidai Xie, Yongjun Wang, Yongjun Wang

AI Summary

FuzzySQL, an LLM-powered adaptive fuzzing framework, was developed to uncover vulnerabilities in under-explored DBMS special features by combining grammar-guided SQL generation with logic-shifting progressive mutation. This approach synthesizes diverse test cases and employs a hybrid error repair pipeline, unifying rule-based patching with LLM-driven semantic repair, to achieve deeper execution coverage. Evaluation across multiple DBMSs revealed 37 vulnerabilities, including 7 in special features, demonstrating the effectiveness of LLM-based fuzzing in finding hidden bugs.

Key Contribution

LLMs can uncover previously hidden vulnerabilities in database management systems by intelligently fuzzing obscure, system-level features that traditional fuzzers miss.

Abstract

Traditional database fuzzing techniques primarily focus on syntactic correctness and general SQL structures, leaving critical yet obscure DBMS features, such as system-level modes (e.g., GTID), programmatic constructs (e.g., PROCEDURE), advanced process commands (e.g., KILL), largely underexplored. Although rarely triggered by typical inputs, these features can lead to severe crashes or security issues when executed under edge-case conditions. In this paper, we present FuzzySQL, a novel LLM-powered adaptive fuzzing framework designed to uncover subtle vulnerabilities in DBMS special features. FuzzySQL combines grammar-guided SQL generation with logic-shifting progressive mutation, a novel technique that explores alternative control paths by negating conditions and restructuring execution logic, synthesizing structurally and semantically diverse test cases. To further ensure deeper execution coverage of the back end, FuzzySQL employs a hybrid error repair pipeline that unifies rule-based patching with LLM-driven semantic repair, enabling automatic correction of syntactic and context-sensitive failures. We evaluate FuzzySQL across multiple DBMSs, including MySQL, MariaDB, SQLite, PostgreSQL and Clickhouse, uncovering 37 vulnerabilities, 7 of which are tied to under-tested DBMS special features. As of this writing, 29 cases have been confirmed with 9 assigned CVE identifiers, 14 already fixed by vendors, and additional vulnerabilities scheduled to be patched in upcoming releases. Our results highlight the limitations of conventional fuzzers in semantic feature coverage and demonstrate the potential of LLM-based fuzzing to discover deeply hidden bugs in complex database systems.

Code Generation & Program Synthesis Natural Language Processing Red-Teaming & Adversarial Robustness

Citation Metrics

Citations0

Influential citations0

References55

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

FuzzySQL: Uncovering Hidden Vulnerabilities in DBMS Special Features with LLM-Driven Fuzzing

Related Papers