← 返回

基于序贯决策的异构智能体强化学习方法——面向配网与输网协同的负荷频率控制

Order-Based Heterogeneous Agents Reinforcement Learning Method With the Coordination of Distribution Network and Transmission Network for Load Frequency Control

作者 Shixuan Yu · Xiaodong Zheng · Tianzhuo Shi · Ruilin Chen · Shuangsi Xue · Hui Cao
期刊 IEEE Transactions on Power Systems
出版日期 2025年8月
卷/期 第 41 卷 第 1 期
技术分类 控制与算法
技术标签 强化学习 调峰调频 微电网 储能变流器PCS
相关度评分 ★★★★★ 5.0 / 5.0
关键词
语言:

中文摘要

针对高比例分布式能源接入下配网参与负荷频率控制(LFC)的需求,本文提出一种序贯驱动的异构智能体软演员-评论家算法(OHASAC),建模为部分可观测马尔可夫博弈,实现配网(含光伏与储能)与输网的协同调频。仿真验证其在多源协调、泛化性与可扩展性上的优势。

English Abstract

With the continuous penetration of large-scale distributed energy resources (DERs) into the distribution network (DN), the DN has acquired the capability to participate in load frequency control (LFC). This paper proposed an order-based heterogeneous-agent soft actor-critic method (OHASAC) to address the coordination challenges among heterogeneous controllable DERs. A neural network is applied to estimate the optimal update sequence of the heterogeneous agents. The optimal LFC problem is formulated as a partially observable Markov game (MG) considering the coordination of DN with sufficient controllable DERs and transmission network (TN). The modeling process considers the coordinated frequency regulation between battery energy storage systems (BESS) and photovoltaics (PV) under variable irradiance conditions. Simulations in the coordinated environment of the distribution network (DN) and transmission network (TN) demonstrate that the proposed method can effectively manage various distributed generation resources while achieving superior generalization and scalability.
S

SunView 深度解读

该研究高度契合阳光电源ST系列PCS、PowerTitan及iSolarCloud平台在构网型调频与光储协同控制中的技术演进需求。OHASAC算法可嵌入PCS实时控制层,提升PowerTitan在电网侧/用户侧储能场景下的动态调频响应精度与多设备协同效率;建议将序贯决策机制集成至iSolarCloud智能运维平台,支撑大规模分布式光伏+储能集群的AGC/AVC闭环优化,强化阳光电源在新型电力系统辅助服务市场的核心竞争力。