Integration of an Inverse Optimal Neural Controller with Reinforced-SLAM for Path Panning and Mapping in Dynamic Environments

This work presents an artificial intelligence approach to solve the problem of finding a path and creating a map in unknown environments using Reinforcement Learning (RL) and Simultaneous Localization and Mapping (SLAM) for a differential mobile robot along with an optimal control system. We propose the integration of these approaches (two of the most widely used ones) for the implementation of robot navigation systems with an efficient method of control composed by a neural identifier and an inverse optimal control in order to obtain a robust and autonomous system of navigation in unknown and dynamic environments.

[1]  Andrew J. Davison,et al.  Mobile Robot Navigation Using Active Vision , 1998 .

[2]  Ricardo Valverde Gil Control de sistemas mediante redes neuronales: aprendizaje por refuerzo , 1999 .

[3]  Sebastian Thrun,et al.  FastSLAM: a factored solution to the simultaneous localization and mapping problem , 2002, AAAI/IAAI.

[4]  A. Alanis,et al.  Real-Time Inverse Optimal Neural Control for Image Based Visual Servoing with Nonholonomic Mobile Robots , 2015 .

[5]  Jong-Hwan Kim,et al.  Sliding mode control for trajectory tracking of nonholonomic wheeled mobile robots , 1999, IEEE Trans. Robotics Autom..

[6]  Zhong-Ping Jiang,et al.  Simultaneous tracking and stabilization of mobile robots: an adaptive approach , 2004, IEEE Transactions on Automatic Control.

[7]  Jin Bae Park,et al.  A Simple Adaptive Control Approach for Trajectory Tracking of Electrically Driven Nonholonomic Mobile Robots , 2010, IEEE Transactions on Control Systems Technology.

[8]  Alma Y. Alanis,et al.  Discrete super twisting control algorithm for the nonholonomic mobile robots tracking problem , 2011, 2011 8th International Conference on Electrical Engineering, Computing Science and Automatic Control.

[9]  John Holland,et al.  Designing Autonomous Mobile Robots: Inside the Mind of an Intelligent Machine , 2003 .

[10]  Hugh Durrant-Whyte,et al.  Simultaneous localization and mapping (SLAM): part II , 2006 .

[11]  Edgar N. Sánchez,et al.  Discrete-time sliding mode controllers for nonholonomic mobile robots trajectory tracking problem , 2011, 2011 8th International Conference on Electrical Engineering, Computing Science and Automatic Control.

[12]  Manolis A. Christodoulou,et al.  Adaptive Control with Recurrent High-order Neural Networks , 2000, Advances in Industrial Control.

[13]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[14]  Nancy Arana-Daniel,et al.  Reinforced-SLAM for path planing and mapping in dynamic environments , 2011, 2011 8th International Conference on Electrical Engineering, Computing Science and Automatic Control.

[15]  Gamini Dissanayake,et al.  Time optimal robot motion control in simultaneous localization and map building (SLAM) problem , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[16]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[17]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[18]  Leslie Pack Kaelbling,et al.  Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[19]  C. S. G. Lee,et al.  Robotics: Control, Sensing, Vision, and Intelligence , 1987 .

[20]  Zhong-Ping Jiang,et al.  A recursive technique for tracking control of nonholonomic systems in chained form , 1999, IEEE Trans. Autom. Control..

[21]  Mo M. Jamshidi,et al.  Sensor Fusion Based Autonomous Mobile Robot Navigation , 2007, 2007 IEEE International Conference on System of Systems Engineering.

[22]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[23]  Indra Narayan Kar,et al.  Design and implementation of an adaptive fuzzy logic-based controller for wheeled mobile robots , 2006, IEEE Transactions on Control Systems Technology.

[24]  Hugh F. Durrant-Whyte,et al.  Simultaneous localization and mapping: part I , 2006, IEEE Robotics & Automation Magazine.

[25]  Kumpati S. Narendra,et al.  Identification and control of dynamical systems using neural networks , 1990, IEEE Trans. Neural Networks.

[26]  Alberto Sanfeliu,et al.  Temporal landmark validation in CML , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[27]  Wolfram Burgard,et al.  Efficient estimation of accurate maximum likelihood maps in 3D , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  A. Vicino,et al.  Mobile robot SLAM for line-based environment representation , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[29]  Mo M. Jamshidi,et al.  Neural network based identification of robot dynamics used for neuro-fuzzy controller , 1997, Proceedings of International Conference on Robotics and Automation.

[30]  Frank L. Lewis,et al.  Control of a nonholonomic mobile robot using neural networks , 1998, IEEE Trans. Neural Networks.

[31]  Nobuyuki Kita,et al.  3D simultaneous localisation and map-building using active vision for a robot moving on undulating terrain , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.