AaltoMar 3, 2026arXiv:2603.02745

Enhancing User Throughput in Multi-panel mmWave Radio Access Networks for Beam-based MU-MIMO Using a DRL Method

Ramin Hashemi, Vismika Ranasinghe, Teemu Veijalainen, Petteri Kela, Risto Wichman

AI Summary

This paper introduces a deep reinforcement learning (DRL) framework for optimizing beam selection in multi-panel mmWave radio access networks employing MU-MIMO. The DRL agent learns an adaptive beam management strategy by modeling the environment as a Markov decision process (MDP) and incorporating spatial domain characteristics like beam cross-correlation, RSRP, and beam usage statistics. Results demonstrate a throughput increase of up to 16% and latency reduction by factors of 3-7x compared to legacy beam management techniques.

Key Contribution

DRL-based beam management in mmWave MU-MIMO networks can boost throughput by 16% and slash latency by up to 7x compared to traditional methods.

Abstract

Millimeter-wave (mmWave) communication systems, particularly those leveraging multi-user multiple-input and multiple-output (MU-MIMO) with hybrid beamforming, face challenges in optimizing user throughput and minimizing latency due to the high complexity of dynamic beam selection and management. This paper introduces a deep reinforcement learning (DRL) approach for enhancing user throughput in multi-panel mmWave radio access networks in a practical network setup. Our DRL-based formulation utilizes an adaptive beam management strategy that models the interaction between the communication agent and its environment as a Markov decision process (MDP), optimizing beam selection based on real-time observations. The proposed framework exploits spatial domain (SD) characteristics by incorporating the cross-correlation between the beams in different antenna panels, the measured reference signal received power (RSRP), and the beam usage statistics to dynamically adjust beamforming decisions. As a result, the spectral efficiency is improved and end-to-end latency is reduced. The numerical results demonstrate an increase in throughput of up to 16% and a reduction in latency by factors 3-7x compared to baseline (legacy beam management).

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Enhancing User Throughput in Multi-panel mmWave Radio Access Networks for Beam-based MU-MIMO Using a DRL Method

Related Papers