Search papers, labs, and topics across Lattice.
This paper introduces a hybrid knowledge-data-driven approach for two-stage voltage control in active distribution networks (ADNs) by dynamically integrating a large language model (LLM) agent for day-ahead scheduling of OLTCs and SCs with a reinforcement learning (RL) agent for intra-day refinement of PV inverter reactive power. The LLM leverages coarse forecasts and grid codes to generate initial strategies, while the RL agent uses node-level measurements for precise voltage regulation. The proposed self-evolution mechanism for the LLM and pretrain-finetune pipeline for the RL agent significantly improves training efficiency and voltage control performance, as demonstrated through comprehensive experiments.
Forget purely data-driven voltage control – this LLM-RL collaboration uses grid knowledge to slash training time and boost performance in active distribution networks.
The growing integration of distributed photovoltaics (PVs) into active distribution networks (ADNs) has exacerbated operational challenges, making it imperative to coordinate diverse equipment to mitigate voltage violations and enhance power quality. Although existing data-driven approaches have demonstrated effectiveness in the voltage control problem, they often require extensive trial-and-error exploration and struggle to incorporate heterogeneous information, such as day-ahead forecasts and semantic-based grid codes. Considering the operational scenarios and requirements in real-world ADNs, in this paper, we propose a hybrid knowledge-data-driven approach that leverages dynamic collaboration between a large language model (LLM) agent and a reinforcement learning (RL) agent to achieve two-stage voltage control. In the day-ahead stage, the LLM agent receives coarse region-level forecasts and generates scheduling strategies for on-load tap changer (OLTC) and shunt capacitors (SCs) to regulate the overall voltage profile. Then in the intra-day stage, based on accurate node-level measurements, the RL agent refines terminal voltages by deriving reactive power generation strategies for PV inverters. On top of the LLM-RL collaboration framework, we further propose a self-evolution mechanism for the LLM agent and a pretrain-finetune pipeline for the RL agent, effectively enhancing and coordinating the policies for both agents. The proposed approach not only aligns more closely with practical operational characteristics but also effectively utilizes the inherent knowledge and reasoning capabilities of the LLM agent, significantly improving training efficiency and voltage control performance. Comprehensive comparisons and ablation studies demonstrate the effectiveness of the proposed method.