VIT-AP UniversityAug 6, 2025

Object Detection using Synthetic Data and Generative AI

O. P. Kumar, Bhukya Chinni Krishna, Bollapalli Althaph, K. A. Vardhan, Palakonda Jesse Angelina, Beebi Naseeba

AI Summary

This paper introduces a system for generating synthetic training data for object detection and segmentation tasks in autonomous driving using Stable Diffusion, CVAT, SAM, and BLIP. The system generates photorealistic images, annotates them automatically, and creates descriptive product captions. The resulting synthetic dataset is used to train a YOLOv8 model, achieving performance comparable to or exceeding that of models trained on real-world data.

Key Contribution

Synthetic data generated via Stable Diffusion and SAM can match or exceed the performance of real-world data for training YOLOv8 object detection models in autonomous driving scenarios.

Abstract

Autonomous driving object segmentation and detection training models generally rely on real-world data, which is costly to annotate, difficult to acquire, and heterogeneous. We present a new system based on deep learning and generative AI methods for producing high-quality synthetic data with the goal of overcoming these challenges. Our system produces photorealistic synthetic images using Stable Diffusion models and annotates them using the CVAT annotation tool and Segment Anything Model (SAM). We divide the obtained data into training sets and validation subsets upon pre-cleaning the given annotations into corresponding segmentation masks. We employ this dataset to train YOLOv8 on object detection and segmentation tasks in such a manner that we are able to check the quality of our produced fake data. In addition, we deploy the BLIP image-captioning feature on Salesforce to produce rich information-based descriptive product captions. Our models trained on our synthetic data are better, or even better, than models trained on real data, according to our experimental test. The test proves the huge potential of synthetic datasets as an economically sustainable and scalable way to train perception systems for autonomous vehicles, particularly for perceiving challenging or exotic driving situations.

Computer Vision Data Curation & Synthetic Data Multimodal Models

Citation Metrics

Citations0

Influential citations0

References1

Year2025

Venue2025 3rd International Conference on Sustainable Computing and Data Communication Systems (ICSCDS)

Related Papers

Finding related papers...

Search

Object Detection using Synthetic Data and Generative AI

Related Papers