Experience-Based Model Selection to Enable Long-Term, Safe Control for Repetitive Tasks Under Changing Conditions

Learning approaches have enabled significant performance improvements in robotic control allowing robots to execute motions that were previously impossible. The majority of the work to date, however, assumes that the parts to be learned are static or slowly changing, which limits their applicability in realistic scenarios with rapid changes in the conditions. This paper presents a method to extend an existing single-mode safe learning controller based on Gaussian Process Regression to learn an increasing number of non-linear models for the robot dynamics. We show that this approach enables a robot to re-use past experiences from a large number of previously visited operating conditions, and to safely adapt when a new and distinct operating condition is encountered. This allows the robot to achieve safety and high performance in a large number of operating conditions that do not have to be specified ahead of time. Our approach runs independently from the controller, imposing no additional computation time on the control loop regardless of the number of previous operating conditions considered. We demonstrate the effectiveness of our approach in experiment on a 900 kg ground robot with both physical and artificial changes to its dynamics. All of our experiments are conducted using vision for localization.

[1]  Johan Larsson,et al.  Industrial-scale autonomous wheeled-vehicle path following by combining iterative learning control with feedback linearization , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  S. Shankar Sastry,et al.  Provably safe and robust learning-based model predictive control , 2011, Autom..

[3]  Sergey Levine,et al.  Optimism-driven exploration for nonlinear systems , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Jonathan P. How,et al.  Probabilistically safe motion planning to avoid dynamic obstacles with uncertain motion patterns , 2013, Auton. Robots.

[5]  Pieter Abbeel,et al.  Learning accurate kinematic control of cable-driven surgical robots using data cleaning and Gaussian Process Regression , 2014, 2014 IEEE International Conference on Automation Science and Engineering (CASE).

[6]  Michael I. Jordan,et al.  Nonparametric Bayesian Learning of Switching Linear Dynamical Systems , 2008, NIPS.

[7]  Christopher D. McKinnon,et al.  Learning multimodal models for robot dynamics online with a mixture of Gaussian process experts , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Jan Peters,et al.  Learning inverse dynamics models with contacts , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[9]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[10]  François Pomerleau,et al.  Expanding the Limits of Vision-based Localization for Long-term Route-following Autonomy , 2017, J. Field Robotics.

[11]  Myoungho Sunwoo,et al.  Interacting Multiple Model Filter-Based Sensor Fusion of GPS With In-Vehicle Sensors for Real-Time Vehicle Positioning , 2012, IEEE Transactions on Intelligent Transportation Systems.

[12]  Nolan Wagener,et al.  Information theoretic MPC for model-based reinforcement learning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Jonathan P. How,et al.  Robust Trajectory Planning for Autonomous Parafoils under Wind Uncertainty , 2013 .

[14]  Sergey Levine,et al.  Model-based reinforcement learning with parametrized physical models and optimism-driven exploration , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Jean-Baptiste Mouret,et al.  Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[16]  Angela P. Schoellig,et al.  Conservative to confident: Treating uncertainty robustly within Learning-Based Control , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[17]  Claire J. Tomlin,et al.  Learning-based model predictive control on a quadrotor: Onboard implementation and experimental results , 2012, 2012 IEEE International Conference on Robotics and Automation.

[18]  Angela P. Schoellig,et al.  Learning-based nonlinear model predictive control to improve vision-based mobile robot path-tracking in challenging outdoor environments , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Anelia Nedelcheva Angelova,et al.  Visual Prediction of Rover Slip: Learning Algorithms and Field Experiments , 2008 .

[20]  Andreas Krause,et al.  Safe controller optimization for quadrotors with Gaussian processes , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Claire J. Tomlin,et al.  Reducing Conservativeness in Safety Guarantees by Learning Disturbances Online: Iterated Guaranteed Safe Online Learning , 2012, Robotics: Science and Systems.

[22]  Angela P. Schoellig,et al.  Robust Constrained Learning-based NMPC enabling reliable mobile robot path tracking , 2016, Int. J. Robotics Res..

[23]  Angela P. Schoellig,et al.  Deep neural networks for improved, impromptu trajectory tracking of quadrotors , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).