Apr 13, 2026arXiv:2604.10953

Diffusion Reinforcement Learning Based Online 3D Bin Packing Spatial Strategy Optimization

Jie Han, Tongqing Li, Qingyang Xu, Yong Song, Bao Pang, Xianfeng Yuan

AI Summary

This paper tackles the online 3D bin packing problem by introducing a diffusion model-based actor network within a reinforcement learning framework. The diffusion model is used to generate packing strategies, improving exploration and sample efficiency compared to traditional DRL methods. Experiments demonstrate a significant increase in the average number of packed items, showcasing the potential for real-world application.

Key Contribution

Diffusion models can supercharge reinforcement learning for online 3D bin packing, achieving significantly better packing density than existing DRL approaches.

Abstract

The online 3D bin packing problem is important in logistics, warehousing and intelligent manufacturing, with solutions shifting to deep reinforcement learning (DRL) which faces challenges like low sample efficiency. This paper proposes a diffusion reinforcement learning-based algorithm, using a Markov decision chain for packing modeling, height map-based state representation and a diffusion model-based actor network. Experiments show it significantly improves the average number of packed items compared to state-of-the-art DRL methods, with excellent application potential in complex online scenarios.

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Citation Metrics

Citations0

Influential citations0

References30

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Diffusion Reinforcement Learning Based Online 3D Bin Packing Spatial Strategy Optimization

Related Papers