Search papers, labs, and topics across Lattice.
This paper introduces a deep reinforcement learning (DRL) framework for optimizing beam selection in multi-panel mmWave radio access networks employing MU-MIMO. The DRL agent learns an adaptive beam management strategy by modeling the environment as a Markov decision process (MDP) and incorporating spatial domain characteristics like beam cross-correlation, RSRP, and beam usage statistics. Results demonstrate a throughput increase of up to 16% and latency reduction by factors of 3-7x compared to legacy beam management techniques.
DRL-based beam management in mmWave MU-MIMO networks can boost throughput by 16% and slash latency by up to 7x compared to traditional methods.
Millimeter-wave (mmWave) communication systems, particularly those leveraging multi-user multiple-input and multiple-output (MU-MIMO) with hybrid beamforming, face challenges in optimizing user throughput and minimizing latency due to the high complexity of dynamic beam selection and management. This paper introduces a deep reinforcement learning (DRL) approach for enhancing user throughput in multi-panel mmWave radio access networks in a practical network setup. Our DRL-based formulation utilizes an adaptive beam management strategy that models the interaction between the communication agent and its environment as a Markov decision process (MDP), optimizing beam selection based on real-time observations. The proposed framework exploits spatial domain (SD) characteristics by incorporating the cross-correlation between the beams in different antenna panels, the measured reference signal received power (RSRP), and the beam usage statistics to dynamically adjust beamforming decisions. As a result, the spectral efficiency is improved and end-to-end latency is reduced. The numerical results demonstrate an increase in throughput of up to 16% and a reduction in latency by factors 3-7x compared to baseline (legacy beam management).