Learning-based model predictive control on a quadrotor: Onboard implementation and experimental results

In this paper, we present details of the real time implementation onboard a quadrotor helicopter of learning-based model predictive control (LBMPC). LBMPC rigorously combines statistical learning with control engineering, while providing levels of guarantees about safety, robustness, and convergence. Experimental results show that LBMPC can learn physically based updates to an initial model, and how as a result LBMPC improves transient response performance. We demonstrate robustness to mis-learning. Finally, we show the use of LBMPC in an integrated robotic task demonstration-The quadrotor is used to catch a ball thrown with an a priori unknown trajectory.

[1]  T. Westerlund,et al.  Remarks on "Asymptotic behavior of the extended Kalman filter as a parameter estimator for linear systems" , 1980 .

[2]  P. Gill,et al.  Fortran package for constrained linear least-squares and convex quadratic programming. User's Guide for LSSOL (Version 1. 0) , 1986 .

[3]  Russell L. Anderson,et al.  A Robot Ping-Pong Player: Experiments in Real-Time Intelligent Control , 1988 .

[4]  S. Sastry,et al.  Adaptive Control: Stability, Convergence and Robustness , 1989 .

[5]  Karl Johan Åström,et al.  Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.

[6]  E. Gilbert,et al.  Theory and computation of disturbance invariant sets for discrete-time linear systems , 1998 .

[7]  J. Gordon Leishman,et al.  Principles of Helicopter Aerodynamics , 2000 .

[8]  Luigi Chisci,et al.  Systems with persistent disturbances: predictive control with restricted constraints , 2001, Autom..

[9]  David Q. Mayne,et al.  Robust model predictive control using tubes , 2004, Autom..

[10]  Steven Lake Waslander,et al.  Multi-agent quadrotor testbed control design: integral sliding mode vs. reinforcement learning , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Claire J. Tomlin,et al.  Distributed Cooperative Search using Information-Theoretic Costs for Particle Filters, with Quadrotor Applications ∗ , 2006 .

[12]  Tarek Hamel,et al.  Control Laws For The Tele Operation Of An Unmanned Aerial Vehicle Known As An X4-flyer , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Roland Siegwart,et al.  Full control of a quadrotor , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  H. J. Ferreau,et al.  An online active set strategy to overcome the limitations of explicit MPC , 2008 .

[15]  Manfred Morari,et al.  Real-time suboptimal model predictive control using a combination of explicit MPC and online optimization , 2008, 2008 47th IEEE Conference on Decision and Control.

[16]  Morgan Quigley,et al.  ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[17]  Vijay Kumar,et al.  The GRASP Multiple Micro-UAV Testbed , 2010, IEEE Robotics & Automation Magazine.

[18]  Ian R. Manchester,et al.  LQR-trees: Feedback Motion Planning via Sums-of-Squares Verification , 2010, Int. J. Robotics Res..

[19]  Pieter Abbeel,et al.  Autonomous Helicopter Aerobatics through Apprenticeship Learning , 2010, Int. J. Robotics Res..

[20]  Lei Sun,et al.  A novel trajectory prediction approach for table-tennis robot based on nonlinear output feedback observer , 2010, 2010 IEEE International Conference on Robotics and Biomimetics.

[21]  Stephen P. Boyd,et al.  Fast Model Predictive Control Using Online Optimization , 2010, IEEE Transactions on Control Systems Technology.

[22]  Raffaello D'Andrea,et al.  Performing and extending aggressive maneuvers using iterative learning control , 2011, Robotics Auton. Syst..

[23]  Albert S. Huang,et al.  Visual Odometry and Mapping for Autonomous Flight Using an RGB-D Camera , 2011, ISRR.

[24]  Claire J. Tomlin,et al.  Precision flight control for a multi-vehicle quadrotor helicopter testbed , 2011 .

[25]  Mac Schwager,et al.  Eyes in the Sky: Decentralized Control for the Deployment of Robotic Camera Networks , 2011, Proceedings of the IEEE.

[26]  Raffaello D'Andrea,et al.  Quadrocopter ball juggling , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[27]  Claire J. Tomlin,et al.  Guaranteed safe online learning of a bounded system , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Roland Siegwart,et al.  Onboard IMU and monocular vision based control for MAVs in unknown in- and outdoor environments , 2011, 2011 IEEE International Conference on Robotics and Automation.

[29]  David E. Culler,et al.  Reducing Transient and Steady State Electricity Consumption in HVAC Using Learning-Based Model-Predictive Control , 2012, Proceedings of the IEEE.

[30]  Claire J. Tomlin,et al.  Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter , 2012, 2012 American Control Conference (ACC).

[31]  S. Shankar Sastry,et al.  Provably safe and robust learning-based model predictive control , 2011, Autom..