论文信息 - Information-Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving

Information-Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving

We present an information-theoretic approach to stochastic optimal control problems that can be used to derive general sampling-based optimization schemes. This new mathematical method is used to develop a sampling-based model predictive control algorithm. We apply this information-theoretic model predictive control scheme to the task of aggressive autonomous driving around a dirt test track, and compare its performance with a model predictive control version of the cross-entropy method.

[1] M. Gerdts,et al. Generating locally optimal trajectories for an automatically driven car , 2009 .

[2] Marc Toussaint,et al. On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference , 2012, Robotics: Science and Systems.

[3] Olivier Sigaud,et al. Path Integral Policy Improvement with Covariance Matrix Adaptation , 2012, ICML.

[4] Luke Fletcher,et al. A perception‐driven autonomous urban vehicle , 2008, J. Field Robotics.

[5] Sebastian Thrun,et al. Toward robotic cars , 2010, CACM.

[6] Sebastian Thrun,et al. Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[7] Emilio Frazzoli,et al. A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles , 2016, IEEE Transactions on Intelligent Vehicles.

[8] A. Lambert,et al. Path Planning using a Dynamic Vehicle Model , 2006, 2006 2nd International Conference on Information & Communication Technologies.

[9] H. Kappen. Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.

[10] Geoffrey E. Hinton,et al. Using Expectation-Maximization for Reinforcement Learning , 1997, Neural Computation.

[11] Manfred Morari,et al. Optimization‐based autonomous racing of 1:43 scale RC cars , 2015, ArXiv.

[12] Giuseppe Oriolo,et al. Feedback control of a nonholonomic car-like robot , 1998 .

[13] Frank Dellaert,et al. IMU Preintegration on Manifold for Efficient Visual-Inertial Maximum-a-Posteriori Estimation , 2015, Robotics: Science and Systems.

[14] Evangelos Theodorou,et al. Nonlinear Stochastic Control and Information Theoretic Dualities: Connections, Interdependencies and Thermodynamic Interpretations , 2015, Entropy.

[15] Peter King,et al. Odin: Team VictorTango's entry in the DARPA Urban Challenge , 2008, J. Field Robotics.

[16] Gabe Sibley,et al. Realtime Simulation-in-the-Loop Control for Agile Ground Vehicles , 2013, TAROS.

[17] Efstathios Velenis,et al. Modeling aggressive maneuvers on loose surfaces: The cases of Trail-Braking and Pendulum-Turn , 2007, 2007 European Control Conference (ECC).

[18] William Whittaker,et al. Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.

[19] Nolan Wagener,et al. Information theoretic MPC for model-based reinforcement learning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[20] Zelda B. Zabinsky,et al. Stochastic Adaptive Search for Global Optimization , 2003 .

[21] Evangelos Theodorou,et al. Relative entropy and free energy dualities: Connections to Path Integral and KL control , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[22] Moritz Diehl,et al. ACADO toolkit—An open‐source framework for automatic control and dynamic optimization , 2011 .

[23] Frank Dellaert,et al. iSAM2: Incremental smoothing and mapping using the Bayes tree , 2012, Int. J. Robotics Res..

[24] Stefan Schaal,et al. Policy Gradient Methods for Robotics , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25] Pakharuddin Mohd. Samin,et al. Integration of Magic Formula Tire Model with Vehicle Handling Model , 2012 .

[26] M. Kac. On distributions of certain Wiener functionals , 1949 .

[27] Yuval Tassa,et al. An integrated system for real-time model predictive control of humanoid robots , 2013, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids).

[28] Stefan Schaal,et al. Reinforcement learning by reward-weighted regression for operational space control , 2007, ICML '07.

[29] D. Dolgov. Practical Search Techniques in Path Planning for Autonomous Driving , 2008 .

[30] Christos Katrakazas,et al. Real-time motion planning methods for autonomous on-road driving: State-of-the-art and future research directions , 2015 .

[31] Thierry Fraichard,et al. Path planning with uncertainty for car-like robots , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[32] Rami Yusef Hindiyeh,et al. Dynamics and control of drifting in automobiles , 2013 .

[33] Evangelos A. Theodorou,et al. Model Predictive Path Integral Control: From Theory to Parallel Computation , 2017 .

[34] Timothy J. Robinson,et al. Sequential Monte Carlo Methods in Practice , 2003 .

[35] James M. Rehg,et al. Aggressive driving with model predictive path integral control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[36] Martial Hebert,et al. Improving Multi-Step Prediction of Learned Time Series Models , 2015, AAAI.

[37] Marin Kobilarov,et al. Cross-entropy motion planning , 2012, Int. J. Robotics Res..

[38] Jan Åslund,et al. Investigating Vehicle Model Detail for Close to Limit Maneuvers Aiming at Optimal Control , 2011 .

[39] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[40] Mario Zanon,et al. Time-optimal race car driving using an online exact hessian based nonlinear MPC algorithm , 2016, 2016 European Control Conference (ECC).

[41] Martial Hebert,et al. Improved Learning of Dynamics Models for Control , 2016, ISER.

[42] Emilio Frazzoli,et al. Steady-state cornering equilibria and stabilisation for a vehicle during extreme operating conditions , 2010 .

[43] Daniel J. Fagnant,et al. Preparing a Nation for Autonomous Vehicles: Opportunities, Barriers and Policy Recommendations , 2015 .

[44] David Q. Mayne,et al. Model predictive control: Recent developments and future promise , 2014, Autom..

[45] James M. Rehg,et al. AutoRally: An Open Platform for Aggressive Autonomous Driving , 2018, IEEE Control Systems.

[46] A. Savitzky,et al. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[47] Panagiotis Tsiotras,et al. Real-Time Near-Optimal Feedback Control of Aggressive Vehicle Maneuvers , 2014 .

[48] Jonathan P. How,et al. Real-Time Motion Planning With Applications to Autonomous Urban Driving , 2009, IEEE Transactions on Control Systems Technology.

[49] Alain Lambert,et al. Safe Path Planning in an Uncertain-Configuration Space using RRT , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.