高级搜索

    基于深度强化学习的泵站动态约束调度方法

    Dynamic Constraint Scheduling Method for Pumping Stations Based on Deep Reinforcement Learning

    • 摘要: 单泵站优化运行是长距离引调水工程调度的基础,现有方法未全面考虑泵站运行中的复杂约束过程和不确定性因素.基于最大熵强化学习框架的软执行者-评论者算法(Soft-Actor-Critic,SAC),考虑不确定因素和约束条件,提出自适应折扣因子并改进成本函数,将泵站优化运行过程转换为一个改进的约束马尔可夫决策过程(Constrained Markov Decision Process,CMDP),以可靠性和经济性为目标,根据泵站环境状态对泵站各时段提水策略进行实时调度优化.基于南水北调东线邓楼泵站历史监测数据实例分析,通过与其他深度强化学习算法和经典优化算法对比,验证了该方法的合理性与优越性.研究结果表明:所提泵站动态约束调度方法表现出更高的收敛稳定性和有效性,能够在较少的叶片安放角调节次数和机组启停次数下,实现相对稳定且经济运行,为泵站管理决策提供了综合性策略.

       

      Abstract: The optimal operation of a single pumping station is crucial for managing long-distance water transfer projects.Current methodologies inadequately address the intricate constraints and uncertainties inherent in the operation and maintenance of such facilities.This study introduces an enhanced Soft Actor-Critic (SAC) algorithm featuring an adaptive discount factor and an improved cost function,grounded in the maximum entropy reinforcement learning framework.This refined approach redefines the optimal operation of the pumping station into a Constrained Markov Decision Process (CMDP) that prioritizes both reliability and economic efficiency.Real-time optimization of the pumping strategy is conducted in response to the environmental conditions of the station.Through analysis of historical data from the Denglou Pumping Station,part of the East Route of the South-to-North Water Diversion Project,this method’s effectiveness and superiority are validated against other advanced deep reinforcement learning algorithms and traditional optimization techniques.The results reveal that the proposed method achieves greater convergence stability and operational efficacy.It ensures more consistent and cost-effective performance with minimal adjustments to blade angles and reduced frequency of unit start-stop actions,thereby offering a robust decision-making framework for pumping station management.

       

    /

    返回文章
    返回