Data-Driven Neuro-Optimal Temperature Control of Water–Gas Shift Reaction Using Stable Iterative Adaptive Dynamic Programming

In this paper, a novel data-driven stable iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal temperature control problems for water-gas shift (WGS) reaction systems. According to the system data, neural networks (NNs) are used to construct the dynamics of the WGS system and solve the reference control, respectively, where the mathematical model of the WGS system is unnecessary. Considering the reconstruction errors of NNs and the disturbances of the system and control input, a new stable iterative ADP algorithm is developed to obtain the optimal control law. The convergence property is developed to guarantee that the iterative performance index function converges to a finite neighborhood of the optimal performance index function. The stability property is developed to guarantee that each of the iterative control laws can make the tracking error uniformly ultimately bounded (UUB). NNs are developed to implement the stable iterative ADP algorithm. Finally, numerical results are given to illustrate the effectiveness of the developed method.

[1]  George Nikolakopoulos,et al.  Piecewise Affine Modeling and Constrained Optimal Control for a Pneumatic Artificial Muscle , 2014, IEEE Transactions on Industrial Electronics.

[2]  Niket S. Kaisare,et al.  Approximate Dynamic Programming based control for Water Gas Shift reactor , 2012 .

[3]  Derong Liu,et al.  Temperature Control in Water-Gas Shift Reaction with Adaptive Dynamic Programming , 2012, ISNN.

[4]  Bo Lincoln,et al.  Relaxing dynamic programming , 2006, IEEE Transactions on Automatic Control.

[5]  J. Ni,et al.  Parametric study of microreactor design for water gas shift reactor using an integrated reaction and heat exchange model , 2005 .

[6]  Derong Liu,et al.  Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems , 2013, IEEE Transactions on Cybernetics.

[7]  J. Fernando A. da Silva,et al.  Fast-Predictive Optimal Control of NPC Multilevel Converters , 2013, IEEE Transactions on Industrial Electronics.

[8]  Shangtai Jin,et al.  Data-Driven Model-Free Adaptive Control for a Class of MIMO Nonlinear Discrete-Time Systems , 2011, IEEE Transactions on Neural Networks.

[9]  Qinglai Wei,et al.  A Novel Iterative $\theta $-Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems , 2014, IEEE Transactions on Automation Science and Engineering.

[10]  Frank L. Lewis,et al.  Reinforcement Learning and Approximate Dynamic Programming for Feedback Control , 2012 .

[11]  Temperature control of the water gas shift reaction in microstructured reactors , 2007 .

[12]  Qinglai Wei,et al.  Dual iterative adaptive dynamic programming for a class of discrete-time nonlinear systems with time-delays , 2012, Neural Computing and Applications.

[13]  Sarangapani Jagannathan,et al.  Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Akshay Kumar Rathore,et al.  Generalized Optimal Pulsewidth Modulation of Multilevel Inverters for Low-Switching-Frequency Control of Medium-Voltage High-Power Industrial AC Drives , 2013, IEEE Transactions on Industrial Electronics.

[15]  Steven X. Ding,et al.  Real-Time Implementation of Fault-Tolerant Control Systems With Performance Optimization , 2014, IEEE Transactions on Industrial Electronics.

[16]  Paul J. Webros A menu of designs for reinforcement learning over time , 1990 .

[17]  Eizo Miyashita,et al.  Optimal Feedback Control for Predicting Dynamic Stiffness During Arm Movement , 2014, IEEE Transactions on Industrial Electronics.

[18]  F. Lewis,et al.  Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[19]  Li Cheng,et al.  An Optimal PID Control Algorithm for Training Feedforward Neural Networks , 2013, IEEE Transactions on Industrial Electronics.

[20]  A. Basile,et al.  Performance Assessment of Water Gas Shift Membrane Reactors by a Two-dimensional Model , 2015 .

[21]  Jennie Si,et al.  Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.

[22]  Jiaqi Liang,et al.  Intelligent Local Area Signals Based Damping of Power System Oscillations Using Virtual Generators and Approximate Dynamic Programming , 2013, IEEE Transactions on Smart Grid.

[23]  Frank L. Lewis,et al.  Lyapunov, Adaptive, and Optimal Design Techniques for Cooperative Systems on Directed Communication Graphs , 2012, IEEE Transactions on Industrial Electronics.

[24]  Ping Zhang,et al.  A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process , 2012 .

[25]  Luigi Fortuna,et al.  Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control , 2009 .

[26]  Isabelle Queinnec,et al.  Optimal State-Feedback Control of Bilinear DC–DC Converters With Guaranteed Regions of Stability , 2012, IEEE Transactions on Industrial Electronics.

[27]  Thomas F. Edgar,et al.  Adaptive Control of a Laboratory Water-Gas Shift Reactor With Dynamic Inlet Conditions , 1989, 1989 American Control Conference.

[28]  Huaguang Zhang,et al.  An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games , 2011, Autom..

[29]  J. M. Moe Design of water-gas shift reactors , 1962 .

[30]  Santiago Arnaltes,et al.  A New Variable-Frequency Optimal Direct Power Control Algorithm , 2013, IEEE Transactions on Industrial Electronics.

[31]  Xijia Lu,et al.  Water–gas shift modeling in coal gasification in an entrained-flow gasifier. Part 1: Development of methodology and model calibration , 2013 .

[32]  Huaguang Zhang,et al.  A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration Algorithm , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[33]  Ali Heydari,et al.  Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Derong Liu,et al.  Numerical adaptive learning control scheme for discrete-time non-linear systems , 2013 .

[35]  Derong Liu,et al.  An iterative ϵ-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state , 2012, Neural Networks.

[36]  Yangmin Li,et al.  Optimal Design, Fabrication, and Control of an $XY$ Micropositioning Stage Driven by Electromagnetic Actuators , 2013, IEEE Transactions on Industrial Electronics.

[37]  Derong Liu,et al.  Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[38]  Han Ho Choi,et al.  SDRE-Based Near Optimal Control System Design for PM Synchronous Motor , 2012, IEEE Transactions on Industrial Electronics.