Intelligent wind farm control via deep reinforcement learning and high-fidelity simulations

Abstract Wind farms’ power-generation efficiency is constrained by the high system complexity. A novel deep reinforcement learning (RL)-based wind farm control scheme is proposed to handle this challenge and achieve power generation optimization. A reward regularization (RR) module is designed to estimate wind turbines’ normalized power outputs under different yaw settings and uncertain wind conditions, which brings strong robustness and adaptability to the proposed control scheme. The RR module is then combined with the deep deterministic policy gradient algorithm to evaluate the optimal yaw settings for all the wind turbines within the farm. The proposed wind farm control scheme is data-driven and model-free, which addresses the limitations of current approaches, including reliance on accurate analytical/parametric models and lack of adaptability to uncertain wind conditions. In addition, a novel composite learning-based controller for each turbine is designed to achieve closed-loop yaw tracking, which can guarantee the exponential convergence of tracking errors in the presence of uncertainties of yaw actuators. The whole control system can be pre-trained offline and fine-tuned online, providing an easy-to-apply solution with enhanced generality and flexibility for wind farms. High-fidelity simulations with SOWFA (simulator for offshore wind farm applications) and Tensorflow show that the proposed scheme can significantly improve the wind farm’s power generation by exploiting a sparse data set without requiring any wake model.

[1]  Haoyong Yu,et al.  Composite learning robot control with guaranteed parameter convergence , 2018, Autom..

[2]  Jason R. Marden,et al.  Wind plant power optimization through yaw control using a parametric model for wake effects—a CFD simulation study , 2016 .

[3]  J. Michalakes,et al.  A numerical study of the effects of atmospheric and wake turbulence on wind turbine dynamics , 2012 .

[4]  Tuhfe Göçmen,et al.  Optimizing wind farm control through wake steering using surrogate models based on high-fidelity simulations , 2020 .

[5]  Ferdinando Chiacchio,et al.  A novel hybrid model for the estimation of energy conversion in a wind farm combining wake effects and stochastic dependability , 2020 .

[6]  Zhe Chen,et al.  Offshore wind farm repowering optimization , 2017 .

[7]  Jan-Willem van Wingerden,et al.  Adjoint-based model predictive control for optimal energy extraction in waked wind farms , 2019, Control Engineering Practice.

[8]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[9]  Kincho H. Law,et al.  Bayesian Ascent: A Data-Driven Optimization Scheme for Real-Time Control With Application to Wind Farm Power Maximization , 2016, IEEE Transactions on Control Systems Technology.

[10]  Kincho H. Law,et al.  A data-driven, cooperative wind farm control to maximize the total power production , 2016 .

[11]  Zhao Yang Dong,et al.  Cooperative Wind Farm Control With Deep Reinforcement Learning and Knowledge-Assisted Learning , 2020, IEEE Transactions on Industrial Informatics.

[12]  Johan Meyers,et al.  A control-oriented dynamic wind farm model: WFSim , 2017 .

[13]  Ervin Bossanyi,et al.  Wind Energy Handbook , 2001 .

[14]  Senu Sirnivas,et al.  Load response of a floating wind turbine to turbulent atmospheric flow , 2019, Applied Energy.

[15]  Yin Yang,et al.  Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems , 2020, IEEE Transactions on Cybernetics.

[16]  Girish Chowdhary,et al.  Concurrent Learning for Parameter Estimation Using Dynamic State-Derivative Estimators , 2015, IEEE Transactions on Automatic Control.

[17]  J. F. Ainslie,et al.  CALCULATING THE FLOWFIELD IN THE WAKE OF WIND TURBINES , 1988 .

[18]  J. Chow,et al.  Windfarm Power Optimization Using Yaw Angle Control , 2017, IEEE Transactions on Sustainable Energy.

[19]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[20]  Tingwen Huang,et al.  Model-Free Optimal Tracking Control via Critic-Only Q-Learning , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Stefano Leonardi,et al.  A large-eddy simulation of wind-plant aerodynamics , 2012 .

[22]  B. Sanderse Aerodynamics of wind turbine wakes , 2009 .

[23]  Jennifer Annoni,et al.  Development of FAST.Farm: A New Multiphysics Engineering Tool for Wind Farm Design and Analysis: Preprint , 2017 .

[24]  Xiaowei Zhao,et al.  Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints , 2020, IEEE Transactions on Control Systems Technology.

[25]  Girish Chowdhary,et al.  Exponential parameter and tracking error convergence guarantees for adaptive controllers without persistency of excitation , 2014, Int. J. Control.

[26]  Andrew Ning,et al.  Wind plant system engineering through optimization of layout and yaw control , 2016 .

[27]  Kathryn E. Johnson,et al.  Simulation comparison of wake mitigation control strategies for a two‐turbine case , 2015 .

[28]  J. Sørensen,et al.  Wind turbine wake aerodynamics , 2003 .

[29]  Jason R. Marden,et al.  A Model-Free Approach to Wind Farm Control Using Game Theoretic Methods , 2013, IEEE Transactions on Control Systems Technology.

[30]  Xiaowei Zhao,et al.  Deep Neural Learning Based Distributed Predictive Control for Offshore Wind Farm Using High-Fidelity LES Data , 2021, IEEE Transactions on Industrial Electronics.

[31]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[32]  Jason Jonkman,et al.  FAST User's Guide , 2005 .

[33]  N. Jensen A note on wind generator interaction , 1983 .

[34]  Haoyong Yu,et al.  Composite Learning From Adaptive Dynamic Surface Control , 2016, IEEE Transactions on Automatic Control.

[35]  Andrew Swift,et al.  Characterizing power performance and wake of a wind turbine under yaw and blade pitch , 2016 .

[36]  Muyiwa S. Adaramola,et al.  Experimental investigation of wake effects on wind turbine performance , 2011 .

[37]  Derong Liu,et al.  Adaptive $Q$ -Learning for Data-Based Optimal Output Regulation With Experience Replay , 2018, IEEE Transactions on Cybernetics.

[38]  Niranjan Ghaisas,et al.  Evaluation of layout and atmospheric stability effects in wind farms using large‐eddy simulation , 2017 .