Information-Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving

We present an information-theoretic approach to stochastic optimal control problems that can be used to derive general sampling-based optimization schemes. This new mathematical method is used to develop a sampling-based model predictive control algorithm. We apply this information-theoretic model predictive control scheme to the task of aggressive autonomous driving around a dirt test track, and compare its performance with a model predictive control version of the cross-entropy method.

[1]  M. Gerdts,et al.  Generating locally optimal trajectories for an automatically driven car , 2009 .

[2]  Marc Toussaint,et al.  On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference , 2012, Robotics: Science and Systems.

[3]  Olivier Sigaud,et al.  Path Integral Policy Improvement with Covariance Matrix Adaptation , 2012, ICML.

[4]  Luke Fletcher,et al.  A perception‐driven autonomous urban vehicle , 2008, J. Field Robotics.

[5]  Sebastian Thrun,et al.  Toward robotic cars , 2010, CACM.

[6]  Sebastian Thrun,et al.  Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[7]  Emilio Frazzoli,et al.  A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles , 2016, IEEE Transactions on Intelligent Vehicles.

[8]  A. Lambert,et al.  Path Planning using a Dynamic Vehicle Model , 2006, 2006 2nd International Conference on Information & Communication Technologies.

[9]  H. Kappen Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.

[10]  Geoffrey E. Hinton,et al.  Using Expectation-Maximization for Reinforcement Learning , 1997, Neural Computation.

[11]  Manfred Morari,et al.  Optimization‐based autonomous racing of 1:43 scale RC cars , 2015, ArXiv.

[12]  Giuseppe Oriolo,et al.  Feedback control of a nonholonomic car-like robot , 1998 .

[13]  Frank Dellaert,et al.  IMU Preintegration on Manifold for Efficient Visual-Inertial Maximum-a-Posteriori Estimation , 2015, Robotics: Science and Systems.

[14]  Evangelos Theodorou,et al.  Nonlinear Stochastic Control and Information Theoretic Dualities: Connections, Interdependencies and Thermodynamic Interpretations , 2015, Entropy.

[15]  Peter King,et al.  Odin: Team VictorTango's entry in the DARPA Urban Challenge , 2008, J. Field Robotics.

[16]  Gabe Sibley,et al.  Realtime Simulation-in-the-Loop Control for Agile Ground Vehicles , 2013, TAROS.

[17]  Efstathios Velenis,et al.  Modeling aggressive maneuvers on loose surfaces: The cases of Trail-Braking and Pendulum-Turn , 2007, 2007 European Control Conference (ECC).

[18]  William Whittaker,et al.  Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.

[19]  Nolan Wagener,et al.  Information theoretic MPC for model-based reinforcement learning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[20]  Zelda B. Zabinsky,et al.  Stochastic Adaptive Search for Global Optimization , 2003 .

[21]  Evangelos Theodorou,et al.  Relative entropy and free energy dualities: Connections to Path Integral and KL control , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[22]  Moritz Diehl,et al.  ACADO toolkit—An open‐source framework for automatic control and dynamic optimization , 2011 .

[23]  Frank Dellaert,et al.  iSAM2: Incremental smoothing and mapping using the Bayes tree , 2012, Int. J. Robotics Res..

[24]  Stefan Schaal,et al.  Policy Gradient Methods for Robotics , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Pakharuddin Mohd. Samin,et al.  Integration of Magic Formula Tire Model with Vehicle Handling Model , 2012 .

[26]  M. Kac On distributions of certain Wiener functionals , 1949 .

[27]  Yuval Tassa,et al.  An integrated system for real-time model predictive control of humanoid robots , 2013, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids).

[28]  Stefan Schaal,et al.  Reinforcement learning by reward-weighted regression for operational space control , 2007, ICML '07.

[29]  D. Dolgov Practical Search Techniques in Path Planning for Autonomous Driving , 2008 .

[30]  Christos Katrakazas,et al.  Real-time motion planning methods for autonomous on-road driving: State-of-the-art and future research directions , 2015 .

[31]  Thierry Fraichard,et al.  Path planning with uncertainty for car-like robots , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[32]  Rami Yusef Hindiyeh,et al.  Dynamics and control of drifting in automobiles , 2013 .

[33]  Evangelos A. Theodorou,et al.  Model Predictive Path Integral Control: From Theory to Parallel Computation , 2017 .

[34]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[35]  James M. Rehg,et al.  Aggressive driving with model predictive path integral control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[36]  Martial Hebert,et al.  Improving Multi-Step Prediction of Learned Time Series Models , 2015, AAAI.

[37]  Marin Kobilarov,et al.  Cross-entropy motion planning , 2012, Int. J. Robotics Res..

[38]  Jan Åslund,et al.  Investigating Vehicle Model Detail for Close to Limit Maneuvers Aiming at Optimal Control , 2011 .

[39]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[40]  Mario Zanon,et al.  Time-optimal race car driving using an online exact hessian based nonlinear MPC algorithm , 2016, 2016 European Control Conference (ECC).

[41]  Martial Hebert,et al.  Improved Learning of Dynamics Models for Control , 2016, ISER.

[42]  Emilio Frazzoli,et al.  Steady-state cornering equilibria and stabilisation for a vehicle during extreme operating conditions , 2010 .

[43]  Daniel J. Fagnant,et al.  Preparing a Nation for Autonomous Vehicles: Opportunities, Barriers and Policy Recommendations , 2015 .

[44]  David Q. Mayne,et al.  Model predictive control: Recent developments and future promise , 2014, Autom..

[45]  James M. Rehg,et al.  AutoRally: An Open Platform for Aggressive Autonomous Driving , 2018, IEEE Control Systems.

[46]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[47]  Panagiotis Tsiotras,et al.  Real-Time Near-Optimal Feedback Control of Aggressive Vehicle Maneuvers , 2014 .

[48]  Jonathan P. How,et al.  Real-Time Motion Planning With Applications to Autonomous Urban Driving , 2009, IEEE Transactions on Control Systems Technology.

[49]  Alain Lambert,et al.  Safe Path Planning in an Uncertain-Configuration Space using RRT , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.