Wavelet Reduced Order Observer Based Adaptive Tracking Control for a Class of Uncertain Multiple Time Delay Nonlinear Systems Subjected to Actuator Saturation Using Actor Critic Architecture

Abstract This Paper investigates the mean to design the reduced order observer and observer based controller for a class of uncertain delayed nonlinear system subjected to actuator saturation using Actor Critic architecture. A new design approach of wavelet based adaptive reduced order observer is proposed. The task of the proposed wavelet adaptive reduced order observer is to identify the unknown system dynamics and to reconstruct the states of the system. Wavelet neural network (WNN) is implemented to approximate the uncertainties present in the system as well as to identify and compensate the nonlinearities introduced in the system due to actuator saturation. Reinforcement learning is applied through Actor-Critic architecture where a separate structure is for both perception (critic) and action (actor). Reinforcement learning is used via two Wavelet Neural networks (WNN), critic WNN and action WNN, which are combined to form an adaptive WNN controller. The critic WNN approximates the “strategic” utility function which is then minimized by the action WNN. Using the feedback control, based on reconstructed states, the behavior of closed loop system is investigated. By Lyapunov-Krasovskii approach, the closed-loop tracking error is proved to be uniformly ultimate bounded. A numerical example is provided to verify the effectiveness of theoretical development.

[1]  Heidar Ali Talebi,et al.  A stable neural network-based observer with application to flexible-joint manipulators , 2006, IEEE Transactions on Neural Networks.

[2]  Luca Zaccarian,et al.  Nonlinear antiwindup applied to Euler-Lagrange systems , 2004, IEEE Transactions on Robotics and Automation.

[3]  Stefan Schaal,et al.  Policy Gradient Methods for Robotics , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Jing Zhou,et al.  Adaptive Neural Network Control of Uncertain Nonlinear Systems in the Presence of Input Saturation , 2006, 2006 9th International Conference on Control, Automation, Robotics and Vision.

[5]  Ji-Feng Zhang,et al.  Reduced-order observer-based control design for nonlinear stochastic systems , 2004, Syst. Control. Lett..

[6]  Giorgio Bartolini,et al.  Reduced-order observer for sliding mode control of nonlinear non-affine systems , 2008, 2008 47th IEEE Conference on Decision and Control.

[7]  Manish Sharma,et al.  Wavelet Adaptive Observer Based Control for a Class of Uncertain Time Delay Nonlinear Systems with Input Constraints , 2009, 2009 International Conference on Advances in Recent Technologies in Communication and Computing.

[8]  Luis G. Crespo,et al.  Optimal performance, robustness and reliability based designs of systems with structured uncertainty , 2003, Proceedings of the 2003 American Control Conference, 2003..

[9]  F. Zhu,et al.  The Design of Reduced-order Observer for Systems with Monotone Nonlinearities , 2007 .

[10]  Jürgen Adamy,et al.  Observer Based Controller Design for Linear Systems with Input Constraints , 2008 .

[11]  Vladimir L. Kharitonov,et al.  Lyapunov-Krasovskii approach to the robust stability analysis of time-delay systems , 2003, Autom..

[12]  Jean-Pierre Richard,et al.  Time-delay systems: an overview of some recent advances and open problems , 2003, Autom..

[13]  Chao Hsing Hsu,et al.  Robust optimal stabilizing observer-based control design of decentralized stochastic singularly-perturbed computer controlled systems with multiple time-varying delays , 2009 .

[14]  Qingling Zhang,et al.  Delay-dependent Robust Stabilization for Uncertain Singular Systems with Multiple Input Delays , 2009 .

[15]  A. Astolfi,et al.  Reduced-order observer design for nonlinear systems , 2007, 2007 European Control Conference (ECC).

[16]  Pan-Chyr Yang,et al.  Adaptive critic anti-slip control of wheeled autonomous robot , 2007 .

[17]  Emilia Fridman,et al.  An improved stabilization method for linear time-delay systems , 2002, IEEE Trans. Autom. Control..