Search papers, labs, and topics across Lattice.
CastFlow introduces a dynamic agentic framework for time series forecasting that decomposes the process into planning, action, forecasting, and reflection stages, leveraging a memory module and multi-view toolkit. It employs a role-specialized design, combining a frozen LLM for general reasoning with a fine-tuned LLM for numerical forecasting guided by ensemble forecasts. The fine-tuned LLM is optimized using a two-stage workflow-oriented training approach combining supervised fine-tuning and reinforcement learning with verifiable rewards, demonstrating state-of-the-art performance across diverse datasets.
LLMs can beat traditional time-series models by orchestrating specialized agents in a dynamic workflow, iteratively refining forecasts with memory and ensemble methods.
Recently, large language models (LLMs) have shown great promise in time series forecasting. However, most existing LLM-based forecasting methods still follow a static generative paradigm that directly maps historical observations to future values in a single pass. Under this paradigm, forecasting is constrained by limited temporal pattern extraction, single-round acquisition of contextual features, one-shot forecast generation, and lack of support from ensemble forecasts. To address these limitations, in this work, we propose CastFlow, a dynamic agentic forecasting framework that enables multi-view temporal pattern extraction, multi-round contextual features acquisition, iterative forecast refinement, and forecasting with ensemble forecasts. First, CastFlow organizes the forecasting process into planning, action, forecasting, and reflection, establishing an agentic workflow. Second, this workflow is supported by a memory module that retrieves prior experience and a multi-view toolkit that constructs diagnostic evidence and provides a reliable ensemble forecast baseline. Third, CastFlow adopts a role-specialized design that combines general-purpose reasoning with specialized numerical forecasting. Under this design, a frozen LLM preserves general-purpose reasoning, while a fine-tuned domain-specific LLM performs evidence-guided numerical forecasting based on the ensemble forecast baseline, rather than from scratch. To optimize a fine-tuned domain-specific LLM, we further develop a two-stage workflow-oriented training that combines supervised fine-tuning (SFT) and reinforcement learning with verifiable rewards (RLVR). To evaluate the effectiveness of CastFlow, we conduct extensive experiments on diverse datasets and show that it achieves superior overall results against strong baselines. We hope that this work can serve as a step toward more adaptive and accurate time series forecasting.