Adaptive neural network tracking control-based reinforcement learning for wheeled mobile robots with skidding and slipping

Abstract To track the desired trajectories of the wheeled mobile robot (WMR) with time-varying forward direction, a reinforcement learning-based adaptive neural tracking algorithm is proposed for the nonlinear discrete-time (DT) dynamic system of the WMR with skidding and slipping. And, the typical model is transformed into an affine nonlinear DT system, the constraint of the coupling robot input torque is extended to pseudo dead zone (PDZ) control input. Three neural networks (NNs) are introduced as action NNs to approximate the unknown modeling item, the skidding and the slipping item and the PDZ item, whereas another NN is employed as critic NN to approximate the strategy utility function. Then, the critic and action NN adaptive laws are designed through the standard gradient-based adaptation method. The uniform ultimate boundedness (UUB) of all signals in the affine nonlinear DT WMR system can be ensured, while the tracking error converging to a small compact set by zero. Numerical simulations are conduced to validate the proposed method.

[1]  Qinmin Yang,et al.  Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  Zhijun Li,et al.  Path-Following Control of Wheeled Planetary Exploration Robots Moving on Deformable Rough Terrain , 2014, TheScientificWorldJournal.

[3]  Yunong Zhang,et al.  Robust adaptive motion/force control for wheeled inverted pendulums , 2010, Autom..

[4]  Shaocheng Tong,et al.  Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone , 2016, IEEE Transactions on Fuzzy Systems.

[5]  Shaocheng Tong,et al.  Optimal Control-Based Adaptive NN Design for a Class of Nonlinear Discrete-Time Block-Triangular Systems , 2016, IEEE Transactions on Cybernetics.

[6]  Shuang Zhang,et al.  Control Design for Nonlinear Flexible Wings of a Robotic Aircraft , 2017, IEEE Transactions on Control Systems Technology.

[7]  Tong Heng Lee,et al.  Design and Implementation of Integral Sliding-Mode Control on an Underactuated Two-Wheeled Mobile Robot , 2014, IEEE Transactions on Industrial Electronics.

[8]  Shaocheng Tong,et al.  Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[10]  Steven Dubowsky,et al.  Online terrain parameter estimation for wheeled mobile robots with application to planetary rovers , 2004, IEEE Transactions on Robotics.

[11]  Wei He,et al.  Adaptive Neural Network Control of an Uncertain Robot With Full-State Constraints , 2016, IEEE Transactions on Cybernetics.

[12]  Dong-Juan Li,et al.  Adaptive Neural Tracking Control for Nonlinear Time-Delay Systems With Full State Constraints , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[13]  Warren E. Dixon,et al.  Homography-based visual servo tracking control of a wheeled mobile robot , 2006, IEEE Transactions on Robotics.

[14]  Arvin Agah,et al.  Mobile Robots for Polar Remote Sensing , 2009 .

[15]  Dongkyoung Chwa,et al.  Sliding-mode tracking control of nonholonomic wheeled mobile robots in polar coordinates , 2004, IEEE Transactions on Control Systems Technology.

[16]  Mou Chen,et al.  Disturbance Attenuation Tracking Control for Wheeled Mobile Robots With Skidding and Slipping , 2017, IEEE Transactions on Industrial Electronics.

[17]  Jun-Ho Oh,et al.  Tracking control of a two-wheeled mobile robot using inputoutput linearization , 1999 .

[18]  Jin Zhang,et al.  Adaptive neural network control for a class of MIMO nonlinear systems with disturbances in discrete-time , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[19]  Frank L. Lewis,et al.  Control of a nonholonomic mobile robot using neural networks , 1998, IEEE Trans. Neural Networks.

[20]  Nguyen Tan Luy,et al.  Reinforcement learning-based intelligent tracking control for wheeled mobile robot , 2014 .

[21]  Danwei Wang,et al.  GPS-Based Tracking Control for a Car-Like Wheeled Mobile Robot With Skidding and Slipping , 2008, IEEE/ASME Transactions on Mechatronics.

[22]  C. L. Philip Chen,et al.  Adaptive Fuzzy Asymptotic Control of MIMO Systems With Unknown Input Coefficients Via a Robust Nussbaum Gain-Based Approach , 2017, IEEE Transactions on Fuzzy Systems.

[23]  Shaocheng Tong,et al.  Adaptive Fuzzy Output Feedback Control for Switched Nonstrict-Feedback Nonlinear Systems With Input Nonlinearities , 2016, IEEE Transactions on Fuzzy Systems.

[24]  Liang Ding,et al.  Experimental study and analysis of the wheels’ steering mechanics for planetary exploration wheeled mobile robots moving on deformable terrain , 2013, Int. J. Robotics Res..

[25]  Chun-Yi Su,et al.  Neural Control of Bimanual Robots With Guaranteed Global Stability and Motion Precision , 2017, IEEE Transactions on Industrial Informatics.

[26]  Danwei Wang,et al.  Modeling and Analysis of Skidding and Slipping in Wheeled Mobile Robots: Control Design Perspective , 2008, IEEE Transactions on Robotics.

[27]  Georges Bastin,et al.  Structural properties and classification of kinematic and dynamic models of wheeled mobile robots , 1996, IEEE Trans. Robotics Autom..

[28]  Mignon Park,et al.  Generalized Extended State Observer Approach to Robust Tracking Control for Wheeled Mobile Robot with Skidding and Slipping , 2013 .

[29]  C. L. Philip Chen,et al.  Adaptive Position/Attitude Tracking Control of Aerial Robot With Unknown Inertial Matrix Based on a New Robust Neural Identifier , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Xingjian Wang,et al.  Teleoperation Control Based on Combination of Wave Variable and Neural Networks , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[31]  Changyin Sun,et al.  Adaptive Neural Network Control of a Flapping Wing Micro Aerial Vehicle With Disturbance Observer , 2017, IEEE Transactions on Cybernetics.

[32]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[33]  Chunming Liu,et al.  A hierarchical reinforcement learning approach for optimal path tracking of wheeled mobile robots , 2012, Neural Computing and Applications.

[34]  S. J. Yoo,et al.  Adaptive tracking control for a class of wheeled mobile robots with unknown skidding and slipping , 2010 .

[35]  Ruifeng Li,et al.  Interface Design of a Physical Human–Robot Interaction System for Human Impedance Adaptive Skill Transfer , 2018, IEEE Transactions on Automation Science and Engineering.

[36]  Shaocheng Tong,et al.  Barrier Lyapunov Functions-based adaptive control for a class of nonlinear pure-feedback systems with full state constraints , 2016, Autom..

[37]  Michael A. Goodrich,et al.  Human-Robot Interaction: A Survey , 2008, Found. Trends Hum. Comput. Interact..

[38]  Peng Shi,et al.  Nonlinear Control for Tracking and Obstacle Avoidance of a Wheeled Mobile Robot With Nonholonomic Constraint , 2016, IEEE Transactions on Control Systems Technology.

[39]  Ashitava Ghosal,et al.  Modeling of slip for wheeled mobile robots , 1995, IEEE Trans. Robotics Autom..

[40]  Danwei Wang,et al.  Modeling Skidding and Slipping in Wheeled Mobile Robots: Control Design Perspective , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[41]  Eun-Ju Hwang,et al.  Robust Backstepping Control Based on a Lyapunov Redesign for Skid-Steered Wheeled Mobile Robots , 2013 .

[42]  Derong Liu,et al.  Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics , 2014, IEEE Transactions on Automation Science and Engineering.

[43]  Chenguang Yang,et al.  Neural-Adaptive Output Feedback Control of a Class of Transportation Vehicles Based on Wheeled Inverted Pendulum Models , 2012, IEEE Transactions on Control Systems Technology.