Reinforecement learning-based optimal tracking control for wheeled mobile robot

This paper proposes a new method to design a reinforcement learning-based integrated kinematic and dynamic tracking control scheme for a nonholonomic wheeled mobile robot. The scheme uses just only one neural network to design an online adaptive synchronous policy iteration algorithm implemented as an actor critic structure. Our tuning law for the single neural network not only learns online a tracking-HJB equation to approximate both the optimal cost and the optimal control law but also guarantees closed-loop stability in real-time. The convergence and stability of the overall system are proven by Lyapunov theory. The simulation results for wheeled mobile robot verify the effectiveness of the proposed controller.

[1]  Wei-Song Lin,et al.  Adaptive critic motion control design of autonomous wheeled mobile robot by dual heuristic programming , 2008, Autom..

[2]  Zenon Hendzel,et al.  Discrete neural dynamic programming in wheeled mobile robot control , 2011 .

[3]  Alireza Mohammad Shahri,et al.  Adaptive feedback linearizing control of nonholonomic wheeled mobile robots in presence of parametric and nonparametric uncertainties , 2011 .

[4]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[5]  Frank L. Lewis,et al.  Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2010, Autom..

[6]  Frank L. Lewis,et al.  Control of a nonholonomic mobile robot using neural networks , 1998, IEEE Trans. Neural Networks.

[7]  Steven J. Bradtke,et al.  Reinforcement Learning Applied to Linear Quadratic Regulation , 1992, NIPS.

[8]  Nguyen Tan Luy,et al.  ROBUST ADAPTIVE CONTROL USING REINFORCEMENT LEARNING FOR NONLINEAR SYSTEM WITH INPUT CONSTRAINTS , 2009 .

[9]  Gao Yanfeng,et al.  Back-Stepping and Neural Network Control of a Mobile Robot for Curved Weld Seam Tracking , 2011 .

[10]  Y. Miyasato Adaptive H∞ control of nonholonomic mobile robot based on inverse optimality , 2008, 2008 American Control Conference.

[11]  Norihiko Adachi,et al.  Adaptive tracking control of a nonholonomic mobile robot , 2000, IEEE Trans. Robotics Autom..

[12]  Frank L. Lewis,et al.  Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[13]  Nguyen Tan Luy,et al.  Robust reinforcement learning-based tracking control for wheeled mobile robot , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[14]  Klaus-Dieter Kuhnert,et al.  Robust adaptive control of nonholonomic mobile robot with parameter and nonparameter uncertainties , 2005, IEEE Transactions on Robotics.

[15]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.