Ewha Womans UniversityTU MunichUQJun 8, 2026arXiv:2606.09539

Efficient Traffic Prediction at Scale: A Systematic Study of STGCN Architectural Depth

Soban Nasir Lone, Mohamed Abouelela, Taeyoung Yu, Jiwon Kim, Constantinos Antoniou

AI Summary

This study systematically evaluates the architectural depth of Spatio-Temporal Graph Convolutional Networks (STGCNs) for traffic prediction, revealing that a 1-block variant outperforms the standard 2-block model for short-term predictions while significantly reducing computational overhead. The research highlights that the 2-block architecture incurs 61% higher CPU latency and 37% lower throughput, raising concerns about its practicality in resource-constrained intelligent transportation systems. Notably, the 3-block variant does not offer any substantial benefits, indicating that the commonly used 2-block configuration may be over-parameterized for many applications.

Key Contribution

The 1-block STGCN variant achieves superior performance with significantly lower computational costs, challenging the assumption that deeper architectures are always better for traffic prediction.

Abstract

Spatio-temporal graph neural networks (STGNNs) have become the dominant approach for traffic prediction, yet their computational requirements pose challenges for practical deployment in intelligent transportation systems (ITS). While recent work has proposed efficient alternatives to STGNNs, a fundamental question remains unexplored: are these architectures themselves over-parameterised? We examine this question using the Spatio-Temporal Graph Convolutional Network (STGCN), one of the most widely adopted models in this domain. Through systematic experiments across four diverse traffic datasets, we compare 1-block, 2-block (standard), and 3-block STGCN variants. Our findings reveal that the single-block architecture achieves optimal performance for short-term prediction (10 mins) on three of four datasets, while incurring only marginal degradation ($\leq$1.8% relative error) at longer horizons. Crucially, the 2-block variant incurs 61% higher CPU inference latency and 37% lower throughput relative to 1-block -- substantial overhead for resource-constrained ITS deployment. The 3-block architecture offers no favourable tradeoff, more than doubling computational cost for $<$0.5% relative improvement. These results suggest that the default 2-block STGCN may be over-parameterised for many applications, with implications for both practitioners deploying traffic prediction systems and researchers benchmarking efficiency-focused methods.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Efficient Traffic Prediction at Scale: A Systematic Study of STGCN Architectural Depth

Related Papers