Data-based optimal tracking of autonomous nonlinear switching systems

In this paper, a data-based scheme is proposed to solve the optimal tracking problem of autonomous nonlinear switching systems. The system state is forced to track the reference signal by minimizing the performance function. First, the problem is transformed to solve the corresponding Bellman optimality equation in terms of the Q-function ( also named as action value function ) . Then, an iterative algorithm based on adaptive dynamic programming ( ADP ) is developed to find the optimal solution which is totally based on sampled data. The linear-in-parameter ( LIP ) neural network is taken as the value function approximator. Considering the presence of approximation error at each iteration step, the generated approximated value function sequence is proved to be boundedness around the exact optimal solution under some verifiable assumptions. Moreover, the effect that the learning process will be terminated after a finite number of iterations is investigated in this paper. A sufficient condition for asymptotically stability of the tracking error is derived. Finally, the effectiveness of the algorithm is demonstrated with three simulation examples.

[1]  Derong Liu,et al.  Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm , 2014, Neurocomputing.

[2]  Liyan Zhang,et al.  Optimal Control of Switched System Based on Neural Network Optimization , 2008, ICIC.

[3]  Haibo He,et al.  Event-Driven Nonlinear Discounted Optimal Regulation Involving a Power System Application , 2017, IEEE Transactions on Industrial Electronics.

[4]  H. Axelsson,et al.  Algorithm for Switching-Time Optimization in Hybrid Dynamical Systems , 2005, Proceedings of the 2005 IEEE International Symposium on, Mediterrean Conference on Control and Automation Intelligent Control, 2005..

[5]  Ali Heydari,et al.  Optimal Codesign of Control Input and Triggering Instants for Networked Control Systems Using Adaptive Dynamic Programming , 2019, IEEE Transactions on Industrial Electronics.

[6]  Moharam Habibnejad Korayem,et al.  Maximum DLCC of Spatial Cable Robot for a Predefined Trajectory Within the Workspace Using Closed Loop Optimal Control Approach , 2011, J. Intell. Robotic Syst..

[7]  Haibo He,et al.  Adaptive Critic Nonlinear Robust Control: A Survey , 2017, IEEE Transactions on Cybernetics.

[8]  Ding Wang,et al.  Robust Policy Learning Control of Nonlinear Plants With Case Studies for a Power System Application , 2020, IEEE Transactions on Industrial Informatics.

[9]  Alberto Olivares,et al.  Framework for Aircraft Trajectory Planning Toward an Efficient Air Traffic Management , 2012 .

[10]  Ali Heydari,et al.  Optimal Switching and Control of Nonlinear Switching Systems Using Approximate Dynamic Programming , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[11]  W. Rudin Principles of mathematical analysis , 1964 .

[12]  O. Stursberg,et al.  A numerical method for hybrid optimal control based on dynamic programming , 2011 .

[13]  Ali Heydari,et al.  Policy iteration for optimal switching with continuous-time dynamics , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[14]  Ali Heydari,et al.  Optimal Switching of DC–DC Power Converters Using Approximate Dynamic Programming , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Munther A. Dahleh,et al.  Suboptimal Control of Switched Systems With an Application to the DISC Engine , 2008, IEEE Transactions on Control Systems Technology.

[16]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[17]  Xuping Xu,et al.  Optimal control of switched systems via non-linear optimization based on direct differentiations of value functions , 2002 .

[18]  Frank L. Lewis,et al.  Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[19]  Haibo He,et al.  Adaptive Event-Triggered Control Based on Heuristic Dynamic Programming for Nonlinear Discrete-Time Systems , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Ali Heydari,et al.  Sub‐optimal switching in anti‐lock brake systems using approximate dynamic programming , 2019, IET Control Theory & Applications.

[21]  Y. Wardi,et al.  Algorithm for optimal mode scheduling in switched systems , 2012, 2012 American Control Conference (ACC).

[22]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[23]  Tingwen Huang,et al.  Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design , 2014, Autom..

[24]  Mohamed Benrejeb,et al.  Optimization of switching instants for optimal control of linear switched systems based on genetic algorithms , 2009, ICONS.

[25]  J. De Leon,et al.  Hybrid control of a multicellular converter , 2007 .

[26]  Derong Liu,et al.  Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Junfei Qiao,et al.  Self-Learning Optimal Regulation for Discrete-Time Nonlinear Systems Under Event-Driven Formulation , 2020, IEEE Transactions on Automatic Control.

[28]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[29]  Ali Heydari,et al.  Optimal multi-therapeutic HIV treatment using a global optimal switching scheme , 2013, Appl. Math. Comput..

[30]  Derong Liu,et al.  Neural-Network-Based Optimal Control for a Class of Unknown Discrete-Time Nonlinear Systems Using Globalized Dual Heuristic Programming , 2012, IEEE Transactions on Automation Science and Engineering.

[31]  Daniel Liberzon,et al.  Switching in Systems and Control , 2003, Systems & Control: Foundations & Applications.

[32]  Ali Heydari,et al.  Optimal switching between autonomous subsystems , 2014, J. Frankl. Inst..

[33]  George G. Lendaris,et al.  Adaptive dynamic programming , 2002, IEEE Trans. Syst. Man Cybern. Part C.

[34]  JiangZhong-Ping,et al.  Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design , 2016 .

[35]  Zhong-Ping Jiang,et al.  Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems , 2013, IEEE Transactions on Automatic Control.

[36]  Ali Heydari,et al.  Feedback Solution to Optimal Switching Problems With Switching Cost , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Ali Heydari,et al.  Optimal Triggering of Networked Control Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[38]  Magnus Egerstedt,et al.  Real-time optimal feedback control of switched autonomous systems , 2009, ADHS.

[39]  Paolo Valigi,et al.  Optimal mode-switching for hybrid systems with varying initial states , 2008 .

[40]  Moharam Habibnejad Korayem,et al.  Analytical design of optimal trajectory with dynamic load-carrying capacity for cable-suspended manipulator , 2012 .

[41]  Jennie Si,et al.  Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.

[42]  Haibo He,et al.  Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[43]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[44]  Panos J. Antsaklis,et al.  Optimal control of switched systems based on parameterization of the switching instants , 2004, IEEE Transactions on Automatic Control.

[45]  Ali Heydari,et al.  Optimal scheduling for reference tracking or state regulation using reinforcement learning , 2015, J. Frankl. Inst..

[46]  Ali Heydari,et al.  Optimal switching with minimum dwell time constraint , 2017, J. Frankl. Inst..

[47]  Tingwen Huang,et al.  Model-Free Optimal Tracking Control via Critic-Only Q-Learning , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[48]  A. Heydari,et al.  Optimal Orbit Transfer with ON-OFF Actuators Using a Closed Form Optimal Switching Scheme , 2013 .

[49]  Chen Cai,et al.  Adaptive traffic signal control using approximate dynamic programming , 2009 .

[50]  Moharam Habibnejad Korayem,et al.  Optimal motion planning of non-linear dynamic systems in the presence of obstacles and moving boundaries using SDRE: application on cable-suspended robot , 2014 .

[51]  Zhong-Ping Jiang,et al.  Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design , 2016, Autom..

[52]  Changyin Sun,et al.  An Event-Triggered Approach for Load Frequency Control With Supplementary ADP , 2017, IEEE Transactions on Power Systems.

[53]  YangQuan Chen,et al.  Optimal switching control via direct search optimization , 2003 .

[54]  Frank L. Lewis,et al.  2009 Special Issue: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems , 2009 .