Lazy reinforcement learning for real-time generation control of parallel cyber-physical-social energy systems

Abstract To learn human intelligence, the social system/human system is added to a cyber–physical energy system in this paper. To accelerate the configuration process of the parameters of the cyber–physical energy system, parallel systems based on artificial societies-computational experiments-parallel execution are added to the cyber–physical energy system, i.e., a parallel cyber–physical–social energy system is proposed in this paper. This paper proposes a real-time generation control framework to replace the conventional generation control framework with multiple time scales, which consist of long-term time scale, short-term time scale, and real-time scale. Since a lazy operator employed into reinforcement learning, a lazy reinforcement learning is proposed for the real-time generation control framework. To reduce the real simulation time, multiple virtual parallel cyber–physical–social energy systems and a real parallel cyber–physical–social energy system are built for the real-time generation control of large-scale multi-area interconnected power systems. Compared with a total of 146016 conventional generation control algorithms and a relaxed artificial neural network in the simulation of IEEE 10-generator 39-bus New-England power system, the proposed lazy reinforcement learning based real-time generation control controller can obtain the highest control performance. The active power between two areas and the systemic frequency deviation can be reduced by the lazy reinforcement learning, and the simulation results verify the effectiveness and feasibility of the proposed lazy reinforcement learning based real-time generation control controller for the parallel cyber–physical–social energy systems.

[1]  Jilai Yu,et al.  A Cyber-Physical Energy Management System for Optimal Sizing and Operation of Networked Nanogrids With Battery Swapping Stations , 2019, IEEE Transactions on Sustainable Energy.

[2]  M. E. El-Hawary,et al.  Combining loss and cost objectives in daily hydro-thermal economic scheduling , 1991 .

[3]  Dongpu Cao,et al.  Parallel driving in CPSS: a unified approach for transport automation and vehicle intelligence , 2017, IEEE/CAA Journal of Automatica Sinica.

[4]  Wei-Chiang Hong,et al.  Hybridization of seasonal chaotic cloud simulated annealing algorithm in a SVR-based load forecasting model , 2015, Neurocomputing.

[5]  Seyed Mohammad Mirjalili,et al.  Multi-Verse Optimizer: a nature-inspired algorithm for global optimization , 2015, Neural Computing and Applications.

[6]  Na Li,et al.  Connecting Automatic Generation Control and Economic Dispatch From an Optimization View , 2016, IEEE Trans. Control. Netw. Syst..

[7]  Bing Li,et al.  An Assistive Navigation Framework for the Visually Impaired , 2015, IEEE Transactions on Human-Machine Systems.

[8]  Xinghuo Yu,et al.  Smart Grids: A Cyber–Physical Systems Perspective , 2016, Proceedings of the IEEE.

[9]  Fang Liu,et al.  A Two-Layer Active Disturbance Rejection Controller Design for Load Frequency Control of Interconnected Power System , 2016, IEEE Transactions on Power Systems.

[10]  Tao Yu,et al.  Hierarchical correlated Q-learning for multi-layer optimal generation command dispatch , 2016 .

[11]  Jack B A Davis,et al.  The Birmingham parallel genetic algorithm and its application to the direct DFT global optimisation of Ir(N) (N = 10-20) clusters. , 2015, Nanoscale.

[12]  Lalit Chandra Saikia,et al.  Automatic generation control using two degree of freedom fractional order PID controller , 2014 .

[13]  Xiao Wang,et al.  ACP-based social computing and parallel intelligence: Societies 5.0 and beyond , 2016, CAAI Trans. Intell. Technol..

[14]  Jian Yang,et al.  Network softwarization and parallel networks: beyond software-defined networks , 2016, IEEE Network.

[15]  Hossam Faris,et al.  Training feedforward neural networks using multi-verse optimizer for binary classification problems , 2016, Applied Intelligence.

[16]  Binod Kumar Sahu,et al.  A novel hybrid LUS–TLBO optimized fuzzy-PID controller for load frequency control of multi-source power system , 2016 .

[17]  Lu Yang,et al.  Mining high-utility itemsets based on particle swarm optimization , 2016, Eng. Appl. Artif. Intell..

[18]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[19]  Nanning Zheng,et al.  Parallel testing of vehicle intelligence via virtual-real interaction , 2019, Science Robotics.

[20]  Paulo Menezes,et al.  Information Model and Architecture Specification for Context Awareness Interaction Decision Support in Cyber-Physical Human–Machine Systems , 2017, IEEE Transactions on Human-Machine Systems.

[21]  Lei Xi,et al.  Wolf pack hunting strategy for automatic generation control of an islanding smart distribution network , 2016 .

[22]  Fei-yue Wang,et al.  Control 5.0: from Newton to Merton in popper's cyber-social-physical spaces , 2016, IEEE/CAA Journal of Automatica Sinica.

[23]  Haijiao Wang,et al.  Two-Time-Scale Coordination Control for a Battery Energy Storage System to Mitigate Wind Power Fluctuations , 2013, IEEE Transactions on Energy Conversion.

[24]  K. W. Chan,et al.  Multi-Agent Correlated Equilibrium Q(λ) Learning for Coordinated Smart Generation Control of Interconnected Power Grids , 2015, IEEE Transactions on Power Systems.

[25]  Antonio J. Conejo,et al.  Optimal management of the automatic generation control service in smart user grids including electric vehicles and distributed resources , 2014 .

[26]  Jun Liang,et al.  Analysis of multi-scale chaotic characteristics of wind power based on Hilbert–Huang transform and Hurst analysis , 2015 .

[27]  Francisco de Leon,et al.  Supplementary damping controller of grid connected dc micro-grids based on Q-learning , 2016, 2016 IEEE Power and Energy Society General Meeting (PESGM).

[28]  Tao Yu,et al.  Relaxed deep learning for real-time economic generation dispatch and control with unified time scale , 2018 .

[29]  Rabindra Kumar Sahu,et al.  A novel hybrid DEPS optimized fuzzy PI/PID controller for load frequency control of multi-area interconnected power systems , 2014 .

[30]  Tao Yu,et al.  Energy Internet Access Equipment Integrating Cyber-Physical Systems: Concepts, Key Technologies, System Development, and Application Prospects , 2019, IEEE Access.

[31]  R. Naresh,et al.  Automatic generation control using disrupted oppositional based gravitational search algorithm optimised sliding mode controller under deregulated environment , 2016 .

[32]  Fei-Yue Wang,et al.  The Emergence of Intelligent Enterprises: From CPS to CPSS , 2010, IEEE Intelligent Systems.

[33]  Vladimiro Miranda,et al.  Demand Dispatch and Probabilistic Wind Power Forecasting in Unit Commitment and Economic Dispatch: A Case Study of Illinois , 2013 .

[34]  Zhao Dong-bin Parallel Control Theory of Complex Systems and Applications , 2012 .

[35]  Shuangshuang Han,et al.  Parallel vehicles based on the ACP theory: Safe trips via self-driving , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[36]  Tao Yu,et al.  Grouped grey wolf optimizer for maximum power point tracking of doubly-fed induction generator based wind turbine , 2017 .

[37]  Yusheng XUE,et al.  Energy internet or comprehensive energy network? , 2015 .

[38]  Fei-Yue Wang Computational Social Systems in a New Period: A Fast Transition Into the Third Axial Age , 2017, IEEE Trans. Comput. Soc. Syst..

[39]  Aranya Chakrabortty,et al.  Time-Scale Modeling of Wind-Integrated Power Systems , 2016, IEEE Transactions on Power Systems.

[40]  Benjamin F. Hobbs,et al.  Real-Time Markets for Flexiramp: A Stochastic Unit Commitment-Based Analysis , 2016, IEEE Transactions on Power Systems.

[41]  Jianzhou Wang,et al.  Analysis and application of forecasting models in wind power integration: A review of multi-step-ahead wind speed forecasting models , 2016 .

[42]  Nanning Zheng,et al.  Parallel learning: a perspective and a framework , 2017, IEEE/CAA Journal of Automatica Sinica.

[43]  Tao Yu,et al.  Stochastic Optimal Relaxed Automatic Generation Control in Non-Markov Environment Based on Multi-Step $Q(\lambda)$ Learning , 2011, IEEE Transactions on Power Systems.

[44]  K. Youcef-Toumi,et al.  Real-time economic dispatch for the supply side of the energy-water nexus , 2014 .

[45]  Haibin Yu,et al.  Agent-based distributed and economic automatic generation control for droop-controlled AC microgrids , 2016 .

[46]  Katsumi Kawano,et al.  Evolution of Autonomous Decentralized Systems for Information and Control , 2008 .

[47]  Diane J. Cook,et al.  Guest EditorialSpecial Issue on Situation, Activity, and Goal Awareness in Cyber-Physical Human-Machine Systems , 2017, IEEE Trans. Hum. Mach. Syst..

[48]  Md. Apel Mahmud,et al.  Multi-Agent Approach for Enhancing Security of Protection Schemes in Cyber-Physical Energy Systems , 2017, IEEE Transactions on Industrial Informatics.

[49]  Seyed Mohammad Mirjalili How effective is the Grey Wolf optimizer in training multi-layer perceptrons , 2014, Applied Intelligence.

[50]  Li Li,et al.  Virtual generation tribe based robust collaborative consensus algorithm for dynamic generation command dispatch optimization of smart grid , 2016 .

[51]  Tao Yu,et al.  R(λ) imitation learning for automatic generation control of interconnected power grids , 2012, Autom..