Search papers, labs, and topics across Lattice.
This paper introduces STDSH-MARL, a multi-agent deep reinforcement learning framework for human-centric traffic signal control in corridor networks, explicitly modeling multimodal travelers and public transportation. The framework employs a dual-stage hypergraph attention mechanism to capture spatio-temporal dependencies across traffic signals, using spatial and temporal hyperedges to represent complex interactions. Results on a corridor network demonstrate that STDSH-MARL outperforms state-of-the-art baselines in improving multimodal performance and prioritizing public transportation, with temporal hyperedges being the most influential component.
By modeling spatio-temporal dependencies with a novel dual-stage hypergraph attention mechanism, STDSH-MARL significantly boosts multimodal traffic performance, especially for public transportation.
Human-centric traffic signal control in corridor networks must increasingly account for multimodal travelers, particularly high-occupancy public transportation, rather than focusing solely on vehicle-centric performance. This paper proposes STDSH-MARL (Spatio-Temporal Dual-Stage Hypergraph based Multi-Agent Reinforcement Learning), a scalable multi-agent deep reinforcement learning framework that follows a centralized training and decentralized execution paradigm. The proposed method captures spatio-temporal dependencies through a novel dual-stage hypergraph attention mechanism that models interactions across both spatial and temporal hyperedges. In addition, a hybrid discrete action space is introduced to jointly determine the next signal phase configuration and its corresponding green duration, enabling more adaptive signal timing decisions. Experiments conducted on a corridor network under five traffic scenarios demonstrate that STDSH-MARL consistently improves multimodal performance and provides clear benefits for public transportation priority. Compared with state-of-the-art baseline methods, the proposed approach achieves superior overall performance. Further ablation studies confirm the contribution of each component of STDSH-MARL, with temporal hyperedges identified as the most influential factor driving the observed performance gains.