Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics

In this paper, direct dynamic programming techniques are utilized to solve the Hamilton Jacobi-Bellman equation forward-in-time for the optimal tracking control of general affine nonlinear discrete-time systems using online approximators (OLA's). The proposed approach, referred as adaptive dynamic programming (ADP), is utilized to solve the infinite horizon optimal tracking control of affine nonlinear discrete-time systems in the presence of unknown internal dynamics and a known control coefficient matrix. The design is implemented using OLA's to realize the optimal feedback control signal and the associated cost function. The feedforward portion of the control input is derived and approximated using an additional OLA for steady state conditions. Novel tuning laws for the OLA's are derived, and all parameters are tuned online. Lyapunov techniques are used to show that all signals are uniformly ultimately bounded (UUB) and that the approximated control signal approaches the optimal control input with small bounded error. In the ideal case when there are no approximation errors, the approximated control converges to the optimal value asymptotically. Simulation results are included to show the effectiveness of the approach.

[1]  Keck Voon Ling,et al.  Inverse optimal adaptive control for attitude tracking of spacecraft , 2005, IEEE Trans. Autom. Control..

[2]  Frank L. Lewis,et al.  Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  T. Basar,et al.  H/sup /spl infin//-optimal tracking control techniques for nonlinear underactuated systems , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[4]  Warren E. Dixon,et al.  Asymptotic Tracking for Uncertain Dynamic Systems Via a Multilayer Neural Network Feedforward and RISE Feedback Control Structure , 2008, IEEE Transactions on Automatic Control.

[5]  Dongbing Gu,et al.  Receding horizon tracking control of wheeled mobile robots , 2006, IEEE Transactions on Control Systems Technology.

[6]  Jeff S. Shamma,et al.  Existence of SDRE stabilizing feedback , 2003, IEEE Trans. Autom. Control..

[7]  Huaguang Zhang,et al.  A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration Algorithm , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  Jagannathan Sarangapani,et al.  Neural Network Control of Nonlinear Discrete-Time Systems , 2018 .