Search papers, labs, and topics across Lattice.
The paper introduces STaT, a novel multi-modal architecture for time series forecasting that integrates symbolic, temporal, and textual modalities to address shape distortion issues in non-stationary environments. STaT converts continuous time series into discrete tokens to identify structural patterns, extracts sequential dependencies, and leverages domain semantics for macroscopic trend forecasting. Experiments on eight real-world datasets show that STaT improves magnitude indicators by up to 8.9% and reduces shape distortion by up to 8.5% compared to existing methods.
Multi-modal time series models can finally capture both magnitude and shape, thanks to a novel symbolic representation that discretizes continuous data into tokens.
Recent research in time series forecasting frequently investigates the integration of textual and visual modalities with numerical models to better navigate non-stationary environments. Despite delivering solid numerical results, existing multi-modal approaches usually encounter a dilemma: prioritizing the minimization of average errors can result in excessively smooth forecasts that overlook essential fluctuations. To resolve this limitation, we introduce STaT, an innovative multimodal architecture for Symbolic-Temporal-Textual Alignment, which seamlessly unites three synergistic modalities. Specifically, the symbolic modality converts continuous time series into discrete tokens, facilitating the accurate identification of structural patterns and turning points; the temporal modality extracts inherent sequential dependencies; and the textual modality leverages domain semantics to steer the macroscopic forecasting trends. Comprehensive evaluations on eight real-world benchmarks indicate that STaT delivers exceptional performance, enhancing conventional magnitude indicators by up to 8.9% while simultaneously decreasing shape distortion by up to 8.5%.