Search papers, labs, and topics across Lattice.
This paper tackles the online 3D bin packing problem by introducing a diffusion model-based actor network within a reinforcement learning framework. The diffusion model is used to generate packing strategies, improving exploration and sample efficiency compared to traditional DRL methods. Experiments demonstrate a significant increase in the average number of packed items, showcasing the potential for real-world application.
Diffusion models can supercharge reinforcement learning for online 3D bin packing, achieving significantly better packing density than existing DRL approaches.
The online 3D bin packing problem is important in logistics, warehousing and intelligent manufacturing, with solutions shifting to deep reinforcement learning (DRL) which faces challenges like low sample efficiency. This paper proposes a diffusion reinforcement learning-based algorithm, using a Markov decision chain for packing modeling, height map-based state representation and a diffusion model-based actor network. Experiments show it significantly improves the average number of packed items compared to state-of-the-art DRL methods, with excellent application potential in complex online scenarios.