Optimal Control Strategy for Unit Operation Based on Reinforcement Learning
Download PDF

Keywords

Deep reinforcement learning
Unit operation optimization
Markov decision process
State space design

DOI

10.26689/jwa.v9i6.13409

Submitted : 2025-12-10
Accepted : 2025-12-25
Published : 2026-01-09

Abstract

Power system operation optimization faces dual challenges from energy structure transformation and extreme environmental conditions. Traditional unit control methods demonstrate limitations in addressing renewable energy volatility, load demand uncertainty, and sudden system disturbances. Deep reinforcement learning, through constructing a state-action-reward decision framework, effectively handles the time-varying, nonlinear, and uncertain characteristics of complex systems, providing new technical pathways for unit operation optimization. Studies show that applications of voltage regulation frameworks based on gated Markov decision processes and reinforcement learning in optimizing high-pressure feedwater heater operations, along with the integration of Hooke-Jeeves algorithm and deep deterministic strategy gradient methods in air handling unit control, all validate deep reinforcement learning’s unique advantages in solving multi-objective optimization problems for power generation units.

References

Zou Y, Ji Y, Li W, et al., 2024, Research on Deep Learning-Based Optimization Modeling Method for Combined Cycle Units. Power Equipment Management, 2024(18): 280–282.

Zhou N, Liang X, Yu X, et al., 2023, Research on Optimal Operation of Integrated Energy Systems Using DRL. Power Big Data, 26(6): 49–57.

Li J, Yao Y, Liu B, 2022, Optimal Operation of Integrated Energy Systems Based on Comprehensive Evaluation Indicators. Journal of Guangxi University (Natural Science Edition), 47(6): 1518–1531.

Liu G, Jin Y, Cao X, et al., 2022, Thermal Power Load Optimization Allocation for Gas Turbine Units Based on Deep Learning and Chaos Optimization. Journal of Thermal Power Generation, 51(2): 178–182.

Nie C, An L, Xu G, et al., 2021, Real-Time Optimization Strategy for Air-Cooled Island Operation in Coal-Fired Power Stations based on Big Data. Journal of Power Engineering, 41(9): 713–720.

Lü J, 2024, Machine Learning-Based Optimization of Energy Efficiency Parameters for Coal-Fired Power Units, thesis, Northeast Electric Power University.

Pan L, 2017, Research on Fault Diagnosis of Key Components in Wind Turbine Drive Systems Using Deep Learning Networks, thesis, Shanghai Dianji University.

Zhang L, Wu H, Li Z, et al., 2024, Design of an AI-based Maintenance System for Thermal Power Plant Units. Mold Manufacturing, 24(11): 207–209.

Zhang Y, Wang L, Liu Y, et al., 2024, A Multi-Turbine Operation Monitoring Method Based on Balanced Distribution Adaptive Transfer Learning. Renewable Energy, 42(8): 1068–1073.

Tang H, Yan Z, Fang D, et al., n.d., Deep Transfer Reinforcement Learning-Based Optimization Method for Flexible Resource Grid Dispatching. Control Engineering, 1–13.