HFUTMay 25, 2026arXiv:2605.25943

STaT: Resolving Shape Distortion in Non-Stationary Time Series via Tri-Modal Synergy

Hui Cheng, Jinsheng Guo, Zhenhao Weng, Yan Qiao

AI Summary

The paper introduces STaT, a novel multi-modal architecture for time series forecasting that integrates symbolic, temporal, and textual modalities to address shape distortion issues in non-stationary environments. STaT converts continuous time series into discrete tokens to identify structural patterns, extracts sequential dependencies, and leverages domain semantics for macroscopic trend forecasting. Experiments on eight real-world datasets show that STaT improves magnitude indicators by up to 8.9% and reduces shape distortion by up to 8.5% compared to existing methods.

Key Contribution

Multi-modal time series models can finally capture both magnitude and shape, thanks to a novel symbolic representation that discretizes continuous data into tokens.

Abstract

Recent research in time series forecasting frequently investigates the integration of textual and visual modalities with numerical models to better navigate non-stationary environments. Despite delivering solid numerical results, existing multi-modal approaches usually encounter a dilemma: prioritizing the minimization of average errors can result in excessively smooth forecasts that overlook essential fluctuations. To resolve this limitation, we introduce STaT, an innovative multimodal architecture for Symbolic-Temporal-Textual Alignment, which seamlessly unites three synergistic modalities. Specifically, the symbolic modality converts continuous time series into discrete tokens, facilitating the accurate identification of structural patterns and turning points; the temporal modality extracts inherent sequential dependencies; and the textual modality leverages domain semantics to steer the macroscopic forecasting trends. Comprehensive evaluations on eight real-world benchmarks indicate that STaT delivers exceptional performance, enhancing conventional magnitude indicators by up to 8.9% while simultaneously decreasing shape distortion by up to 8.5%.

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

STaT: Resolving Shape Distortion in Non-Stationary Time Series via Tri-Modal Synergy

Related Papers