Output constrained adaptive dynamic programming for continuous-time nonlinear systems

This study presents a novel adaptive dynamic programming (ADP) based strategy for the optimal control of a class of continuous-time unknown nonlinear systems with output-constraints. Our contribution includes a step forward beyond the usual optimal control result to show that the output of the plant is always within pre-defined bounds, which can finds numerous applications in renewable energy control and integration, power flow optimization, and inverter control. To obtain the new results, an error transformation function is introduced to generate an equivalent nonlinear system, whose asymptotic stability ensures both the asymptotic stability and the satisfaction of the output restriction of the original system. Besides, ADP algorithms are developed to solve the transformed nonlinear optimal problem with completely unknown system dynamics. Furthermore, asymptotic stability of the original and transformed nonlinear system is theoretically guaranteed. Finally, comparison results demonstrate the merits of the proposed policy.

[1]  Yu Jiang,et al.  Robust Adaptive Dynamic Programming and Feedback Stabilization of Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Qinmin Yang,et al.  Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Donald E. Kirk,et al.  Optimal control theory : an introduction , 1970 .

[4]  George N. Saridis,et al.  An Approximation Theory of Optimal Control for Trainable Manipulators , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  Boon Ping Yeo Quasilinearization and optimal control problems with a state inequality constraint , 2003 .

[6]  Guanghong Yang,et al.  Adaptive nearly optimal control for a class of continuous-time nonaffine nonlinear systems with inequality constraints. , 2017, ISA transactions.

[7]  B. Finlayson The method of weighted residuals and variational principles : with application in fluid mechanics, heat and mass transfer , 1972 .

[8]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[9]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[10]  Derong Liu,et al.  Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems , 2014, IEEE Transactions on Cybernetics.

[11]  Zhong-Ping Jiang,et al.  Robust adaptive dynamic programming for linear and nonlinear systems: An overview , 2013, Eur. J. Control.

[12]  Derong Liu,et al.  Optimal Control of Unknown Nonlinear Discrete-Time Systems Using the Iterative Globalized Dual Heuristic Programming Algorithm , 2013 .

[13]  Youxian Sun,et al.  Transient stability enhancement control of power systems with time-varying constraints , 2016 .

[14]  Frank L. Lewis,et al.  Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[15]  Derong Liu,et al.  Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming , 2016, Neurocomputing.

[16]  D. Jacobson,et al.  A transformation technique for optimal control problems with a state variable inequality constraint , 1969 .

[17]  Youxian Sun,et al.  Guaranteed Performance Control of DFIG Variable-Speed Wind Turbines , 2016, IEEE Transactions on Control Systems Technology.

[18]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..