Search papers, labs, and topics across Lattice.
This research introduces an AI-driven synthetic data generation (SDG) framework tailored for smart city cybersecurity, addressing the critical shortage of high-quality datasets necessary for developing effective security tools. By utilizing generative models, the framework produces realistic datasets that simulate device behaviors, network interactions, and cyber-attack scenarios, while ensuring compliance with protocol standards and statistical similarity to real-world data. The findings indicate that these synthetic datasets significantly enhance the ability to model threats and evaluate defensive strategies, thereby improving the protection of vital smart city infrastructures.
AI-generated synthetic datasets can fill the critical data void in smart city cybersecurity, enabling more effective threat modeling and defense evaluation.
Smart cities rely on interconnected cyber-physical systems that integrate sensors, IoT devices, cloud platforms, and AI-driven services and decision-making. While these systems enhance city services, they also introduce complex cybersecurity challenges due to their large attack surfaces, heterogeneous data flows, and evolving threat vectors. Developing and validating cybersecurity tools for smart cities requires high-quality datasets that accurately represent real operational conditions. However, real-world datasets are often incomplete, contain privacy-sensitive data, are difficult to access, or lack sufficient malicious activity to support tool development. This research addresses this critical gap by proposing an AI-based synthetic data generation (SDG) framework designed specifically for smart city cybersecurity research. The proposed framework leverages generative artificial intelligence models to produce high-fidelity synthetic cybersecurity datasets that replicate realistic device behaviors, network interactions, and cyber-attack scenarios. The synthetic datasets are evaluated for conformity to protocol standards, statistical similarity to original datasets, and utility in common security tools. The resulting synthetic data generation framework and evaluation metrics are expected to advance smart city cybersecurity by enabling researchers to model threats more effectively and evaluate defensive techniques more comprehensively to better protect critical smart city infrastructures.