Search papers, labs, and topics across Lattice.
This paper introduces SDQN-RMFS, a neuromorphic reinforcement learning framework designed for efficient pathfinding in robotic mobile fulfillment systems (RMFS) under real-time constraints. By utilizing a collision-allowing training strategy and converting a trained artificial neural network (ANN) into a spiking neural network (SNN), the framework achieves significant energy savings and reduced latency while maintaining decision quality. Experimental results show an impressive 11,281脳 reduction in energy consumption and nearly 50% latency improvement compared to traditional GPU implementations, highlighting the potential of neuromorphic computing in practical applications.
Achieving over 11,000脳 energy savings in robotic pathfinding without sacrificing decision quality could revolutionize the efficiency of mobile fulfillment systems.
Dynamic environmental changes, confined workspaces, and stringent real-time constraints make pathfinding in Robotic Mobile Fulfillment Systems (RMFS) a challenging problem for conventional search- and rule-based methods, which typically suffer from high computational complexity and long decision latency. While reinforcement learning (RL) has emerged as a powerful alternative, deploying learned policies with extreme energy efficiency on resource-constrained hardware remains an open challenge. We present SDQN-RMFS, an end-to-end framework that achieves high-fidelity deployment of an RL-trained policy from a full-precision artificial neural network (ANN) through to a neuromorphic chip. By computing only when triggered by sparse events, this framework unlocks ultra-low-power RMFS pathfinding. Our full-stack pipeline operates as follows: an ANN policy is first efficiently trained via a collision-allowing strategy to densify informative trajectories, and then converted into a spiking neural network (SNN) via a hard-label knowledge distillation approach. This effectively addresses the output distribution mismatch, preserving policy capability across the ANN-to-SNN pipeline while substantially reducing inference latency. Hardware experiments demonstrate up to 11,281$\times$ energy savings and a nearly two-fold reduction in latency compared to a high-performance GPU baseline, while maintaining decision quality on par with the original trained policy. These results establish physical neuromorphic inference as a practical and energy-sustainable pathway for large-scale RMFS operations.