Robust adaptive dynamic programming based online tracking control algorithm for real wheeled mobile robot with omni-directional vision system

This paper proposes a new method to design an online robust adaptive dynamic programming algorithm (RADPA) for a wheeled mobile robot which is equipped with an omni-directional vision system. To integrate kinematic and dynamic controllers into the unique controller, we transform the strict feedback system dynamics into tracking error dynamics. Then, we propose a control scheme which uses only one neural network rather than three proposed in the actor-critic-based control schemes for the two-player zero-sum game problem. A neural network weight update law is designed for approximating the solution of the Hamilton–Jacobi–Isaacs equation without knowing knowledge of internal system dynamics. To implement the scheme, we propose the online RADPA, in which control and disturbance laws are updated simultaneously in an iterative loop. The convergence and stability of the online RADPA are proven by Lyapunov techniques. Simulations and experiments on a wheeled mobile robot testbed are carried out to verify the effectiveness of the proposed algorithm.

[1]  Huai-Ning Wu,et al.  Neural Network Based Online Simultaneous Policy Update Algorithm for Solving the HJI Equation in Nonlinear $H_{\infty}$ Control , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Wei-Song Lin,et al.  Adaptive critic motion control design of autonomous wheeled mobile robot by dual heuristic programming , 2008, Autom..

[3]  Simon G. Fabri,et al.  Dual Adaptive Dynamic Control of Mobile Robots Using Neural Networks , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  S. Jagannathan,et al.  Neural Network-Based Optimal Control of Mobile Robot Formations With Reduced Information Exchange , 2013, IEEE Transactions on Control Systems Technology.

[5]  Frank L. Lewis,et al.  Control of a nonholonomic mobile robot using neural networks , 1998, IEEE Trans. Neural Networks.

[6]  F. Lewis,et al.  Online adaptive algorithm for optimal control with integral reinforcement learning , 2014 .

[7]  Dongkyoung Chwa,et al.  Tracking Control of Differential-Drive Wheeled Mobile Robots Using a Backstepping-Like Feedback Linearization , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[8]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[9]  Chih-Yung Chen,et al.  Intelligent omni-directional vision-based mobile robot fuzzy systems design and implementation , 2010, Expert Syst. Appl..

[10]  Sarangapani Jagannathan,et al.  Adaptive neural network‐based optimal control of nonlinear continuous‐time systems in strict‐feedback form , 2014 .

[11]  S. Jagannathan,et al.  Optimal control of affine nonlinear continuous-time systems using an online Hamilton-Jacobi-Isaacs formulation , 2010, 49th IEEE Conference on Decision and Control (CDC).

[12]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[13]  Frank L. Lewis,et al.  Adaptive dynamic programming for online solution of a zero-sum differential game , 2011 .

[14]  Fang Yang,et al.  Adaptive stabilization for nonholonomic mobile robots with uncertain dynamics and unknown visual parameters , 2015 .

[15]  Frank L. Lewis,et al.  Neurodynamic Programming and Zero-Sum Games for Constrained Control Systems , 2008, IEEE Transactions on Neural Networks.

[16]  A. Schaft L/sub 2/-gain analysis of nonlinear systems and nonlinear state-feedback H/sub infinity / control , 1992 .

[17]  B. Finlayson The method of weighted residuals and variational principles : with application in fluid mechanics, heat and mass transfer , 1972 .

[18]  Andrea Bonarini,et al.  An omnidirectional vision sensor for fast tracking for mobile robots , 1999, IMTC/99. Proceedings of the 16th IEEE Instrumentation and Measurement Technology Conference (Cat. No.99CH36309).

[19]  Petros A. Ioannou,et al.  Adaptive control tutorial , 2006, Advances in design and control.

[20]  Xiaoming Hu,et al.  Connectivity maintenance and distributed tracking for double-integrator agents with bounded potential functions , 2015 .

[21]  Marco H. Terra,et al.  Experimental results on the nonlinear H∞ control via quasi-LPV representation and game theory for wheeled mobile robots , 2007, 2007 IEEE International Conference on Control Applications.

[22]  Frank L. Lewis,et al.  Neural Network Control Of Robot Manipulators And Non-Linear Systems , 1998 .

[23]  Frank L. Lewis,et al.  Online solution of nonlinear two-player zero-sum games using synchronous policy iteration , 2010, 49th IEEE Conference on Decision and Control (CDC).

[24]  Ahmad B. Rad,et al.  Indirect adaptive tracking control of a nonholonomic mobile robot via neural networks , 2012, Neurocomputing.

[25]  Marco H. Terra,et al.  Experimental results on the nonlinear ∞ control via quasi-LPV representation and game theory for wheeled mobile robots , 2009, Robotica.

[26]  Nguyen Tan Luy,et al.  Reinforecement learning-based optimal tracking control for wheeled mobile robot , 2012, 2012 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER).

[27]  Bor-Sen Chen,et al.  A nonlinear adaptive H∞ tracking control design in robotic systems via neural networks , 1996, IEEE Trans. Control. Syst. Technol..

[28]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[29]  Nguyen Tan Luy,et al.  Reinforcement learning-based intelligent tracking control for wheeled mobile robot , 2014 .

[30]  Huaguang Zhang,et al.  An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games , 2011, Autom..

[31]  Norihiko Adachi,et al.  Adaptive tracking control of a nonholonomic mobile robot , 2000, IEEE Trans. Robotics Autom..

[32]  Paul J. Werbos,et al.  Approximate dynamic programming for real-time control and neural modeling , 1992 .

[33]  Zenon Hendzel,et al.  Discrete neural dynamic programming in wheeled mobile robot control , 2011 .

[34]  Alireza Mohammad Shahri,et al.  Adaptive feedback linearizing control of nonholonomic wheeled mobile robots in presence of parametric and nonparametric uncertainties , 2011 .

[35]  Frank L. Lewis,et al.  Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2010, Autom..

[36]  Derong Liu,et al.  Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics , 2014, IEEE Transactions on Automation Science and Engineering.

[37]  Bor-Sen Chen,et al.  A Nonlinear Adaptive Tracking Control Design in Robotic Systems via Neural Networks , 1998 .

[38]  Warren B. Powell,et al.  Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .

[39]  Frank L. Lewis,et al.  Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[40]  Frank L. Lewis,et al.  Online learning algorithm for zero-sum games with integral reinforcement learning , 2011 .