Jun 18, 2026arXiv:2606.20031

A Neuromorphic Reinforcement Learning Framework for Efficient Pathfinding in Robotic Mobile Fulfillment Systems

Junzhe Xu, Zecui Zeng, Lusong Li, Yuetong Fang, Renjing Xu

AI Summary

This paper introduces SDQN-RMFS, a neuromorphic reinforcement learning framework designed for efficient pathfinding in robotic mobile fulfillment systems (RMFS) under real-time constraints. By utilizing a collision-allowing training strategy and converting a trained artificial neural network (ANN) into a spiking neural network (SNN), the framework achieves significant energy savings and reduced latency while maintaining decision quality. Experimental results show an impressive 11,281× reduction in energy consumption and nearly 50% latency improvement compared to traditional GPU implementations, highlighting the potential of neuromorphic computing in practical applications.

Key Contribution

Achieving over 11,000× energy savings in robotic pathfinding without sacrificing decision quality could revolutionize the efficiency of mobile fulfillment systems.

Abstract

Dynamic environmental changes, confined workspaces, and stringent real-time constraints make pathfinding in Robotic Mobile Fulfillment Systems (RMFS) a challenging problem for conventional search- and rule-based methods, which typically suffer from high computational complexity and long decision latency. While reinforcement learning (RL) has emerged as a powerful alternative, deploying learned policies with extreme energy efficiency on resource-constrained hardware remains an open challenge. We present SDQN-RMFS, an end-to-end framework that achieves high-fidelity deployment of an RL-trained policy from a full-precision artificial neural network (ANN) through to a neuromorphic chip. By computing only when triggered by sparse events, this framework unlocks ultra-low-power RMFS pathfinding. Our full-stack pipeline operates as follows: an ANN policy is first efficiently trained via a collision-allowing strategy to densify informative trajectories, and then converted into a spiking neural network (SNN) via a hard-label knowledge distillation approach. This effectively addresses the output distribution mismatch, preserving policy capability across the ANN-to-SNN pipeline while substantially reducing inference latency. Hardware experiments demonstrate up to 11,281$\times$ energy savings and a nearly two-fold reduction in latency compared to a high-performance GPU baseline, while maintaining decision quality on par with the original trained policy. These results establish physical neuromorphic inference as a practical and energy-sustainable pathway for large-scale RMFS operations.

Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

A Neuromorphic Reinforcement Learning Framework for Efficient Pathfinding in Robotic Mobile Fulfillment Systems

Related Papers