论文信息 - Robust Guarantees for Perception-Based Control

Robust Guarantees for Perception-Based Control

Motivated by vision based control of autonomous vehicles, we consider the problem of controlling a known linear dynamical system for which partial state information, such as vehicle position, can only be extracted from high-dimensional data, such as an image. Our approach is to learn a perception map from high-dimensional data to partial-state observation and its corresponding error profile, and then design a robust controller. We show that under suitable smoothness assumptions on the perception map and generative model relating state to high-dimensional data, an affine error model is sufficiently rich to capture all possible error profiles, and can further be learned via a robust regression problem. We then show how to integrate the learned perception map and error model into a novel robust control synthesis procedure, and prove that the resulting perception and control loop has favorable generalization properties. Finally, we illustrate the usefulness of our approach on a synthetic example and on the self-driving car simulation platform CARLA.

[1] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[2] Mayuresh V. Kothare,et al. Robust output feedback model predictive control using off-line linear matrix inequalities , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[3] Nikolai Matni,et al. Finite-Data Performance Guarantees for the Output-Feedback Control of an Unknown System , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[4] B. P. Zhang,et al. Estimation of the Lipschitz constant of a function , 1996, J. Glob. Optim..

[5] Michael R Chernick,et al. Bootstrap Methods: A Guide for Practitioners and Researchers , 2007 .

[6] Byron Boots,et al. Agile Autonomous Driving using End-to-End Deep Imitation Learning , 2017, Robotics: Science and Systems.

[7] Andrea Montanari,et al. Learning Networks of Stochastic Differential Equations , 2010, NIPS.

[8] Anders Rantzer,et al. Concentration Bounds for Single Parameter Adaptive Control , 2018, 2018 Annual American Control Conference (ACC).

[9] Benjamin Van Roy,et al. A Tutorial on Thompson Sampling , 2017, Found. Trends Mach. Learn..

[10] Munther A. Dahleh,et al. Finite-Time System Identification for Partially Observed LTI Systems of Unknown Order , 2019, ArXiv.

[11] Alessandro Lazaric,et al. Improved Regret Bounds for Thompson Sampling in Linear Quadratic Control Problems , 2018, ICML.

[12] Vijay Kumar,et al. Estimation, Control, and Planning for Aggressive Flight With a Small Quadrotor With a Single Camera and IMU , 2017, IEEE Robotics and Automation Letters.

[13] R. Rockafellar,et al. Optimization of conditional value-at risk , 2000 .

[14] Nikolai Matni,et al. Learning Sparse Dynamical Systems from a Single Sample Trajectory , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).

[15] Andreas Krause,et al. Safe Model-based Reinforcement Learning with Stability Guarantees , 2017, NIPS.

[16] Tengyu Ma,et al. Gradient Descent Learns Linear Dynamical Systems , 2016, J. Mach. Learn. Res..

[17] Benjamin Recht,et al. Certainty Equivalent Control of LQR is Efficient , 2019, ArXiv.

[18] D. Hinrichsen,et al. Real and Complex Stability Radii: A Survey , 1990 .

[19] Roland Siegwart,et al. A robust and modular multi-sensor fusion approach applied to MAV navigation , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20] Xin Zhang,et al. End to End Learning for Self-Driving Cars , 2016, ArXiv.

[21] Friedrich Fraundorfer,et al. Visual Odometry Part I: The First 30 Years and Fundamentals , 2022 .

[22] George J. Pappas,et al. Finite Sample Analysis of Stochastic System Identification , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).

[23] Alexander Rakhlin,et al. Near optimal finite time identification of arbitrary linear dynamical systems , 2018, ICML.

[24] Alexander Rakhlin,et al. How fast can linear dynamical systems be learned? , 2018, ArXiv.

[25] Shie Mannor,et al. Online Learning for Adversaries with Memory: Price of Past Mistakes , 2015, NIPS.

[26] Angela P. Schoellig,et al. Learning-based nonlinear model predictive control to improve vision-based mobile robot path-tracking in challenging outdoor environments , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[27] Kristiaan Pelckmans,et al. Worst-Case Prediction Performance Analysis of the Kalman Filter , 2016, IEEE Transactions on Automatic Control.

[28] Nevena Lazic,et al. Model-Free Linear Quadratic Control via Reduction to Expert Prediction , 2018, AISTATS.

[29] Aaron D. Ames,et al. Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems* , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[30] Yi Ouyang,et al. Learning-based Control of Unknown Linear Systems with Thompson Sampling , 2017, ArXiv.

[31] Vijay Kumar,et al. Aggressive Flight With Suspended Payloads Using Vision-Based Control , 2018, IEEE Robotics and Automation Letters.

[32] Samet Oymak,et al. Non-asymptotic Identification of LTI Systems from a Single Trajectory , 2018, 2019 American Control Conference (ACC).

[33] J. Pearson,et al. l^{1} -optimal feedback controllers for MIMO discrete-time systems , 1987 .

[34] Dimitrios G. Kottas,et al. Camera-IMU-based localization: Observability analysis and consistency improvement , 2014, Int. J. Robotics Res..

[35] Gaurav S. Sukhatme,et al. Visual-Inertial Sensor Fusion: Localization, Mapping and Sensor-to-Sensor Self-calibration , 2011, Int. J. Robotics Res..

[36] Jitendra Malik,et al. Combining Optimal Control and Learning for Visual Navigation in Novel Environments , 2019, CoRL.

[37] Tamal Mukherjee,et al. System-Level Synthesis , 2003 .

[38] Andrew Y. Ng,et al. Learning Feature Representations with K-Means , 2012, Neural Networks: Tricks of the Trade.

[39] Kim Peter Wabersich,et al. Linear Model Predictive Safety Certification for Learning-Based Control , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[40] Csaba Szepesvári,et al. Regret Bounds for the Adaptive Control of Linear Quadratic Systems , 2011, COLT.

[41] Stefano Soatto,et al. Visual-inertial navigation, mapping and localization: A scalable real-time causal approach , 2011, Int. J. Robotics Res..

[42] Nikolai Matni,et al. Scalable system level synthesis for virtually localizable systems , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[43] D. Freedman,et al. Some Asymptotic Theory for the Bootstrap , 1981 .

[44] Nikolai Matni,et al. Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator , 2018, NeurIPS.

[45] Nikolai Matni,et al. Structured state space realizations for SLS distributed controllers , 2017, 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[46] Sergey Levine,et al. (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.

[47] Ather Gattami,et al. H infinity Analysis Revisited , 2014, ArXiv.

[48] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .

[49] Michael I. Jordan,et al. Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification , 2018, COLT.

[50] B. Dumitrescu. Positive Trigonometric Polynomials and Signal Processing Applications , 2007 .

[51] Gábor Orosz,et al. End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks , 2019, AAAI.

[52] James M. Rehg,et al. Information-Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving , 2017, IEEE Transactions on Robotics.

[53] Michael Bosse,et al. Get Out of My Lab: Large-scale, Real-Time Visual-Inertial Localization , 2015, Robotics: Science and Systems.

[54] Byron Boots,et al. Deep Forward and Inverse Perceptual Models for Tracking and Prediction , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[55] David Q. Mayne,et al. Robust output feedback model predictive control of constrained linear systems , 2006, Autom..

[56] Juan D. Tardós,et al. ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras , 2016, IEEE Transactions on Robotics.

[57] Nikolai Matni,et al. A System-Level Approach to Controller Synthesis , 2016, IEEE Transactions on Automatic Control.

[58] Shie Mannor,et al. Online Learning for Time Series Prediction , 2013, COLT.

[59] Thomas Kailath,et al. H∞ bounds for least-squares estimators , 2001, IEEE Trans. Autom. Control..

[60] Sham M. Kakade,et al. Online Control with Adversarial Disturbances , 2019, ICML.

[61] Yaroslav D. Sergeyev,et al. Lipschitz Global Optimization , 2011 .

[62] Nikolai Matni,et al. On the Sample Complexity of the Linear Quadratic Regulator , 2017, Foundations of Computational Mathematics.

[63] Jürgen Schmidhuber,et al. A Machine Learning Approach to Visual Perception of Forest Trails for Mobile Robots , 2016, IEEE Robotics and Automation Letters.

[64] Alexey Dosovitskiy,et al. End-to-End Driving Via Conditional Imitation Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[65] Juraj Kabzan,et al. Cautious Model Predictive Control Using Gaussian Process Regression , 2017, IEEE Transactions on Control Systems Technology.

[66] Yi Lin,et al. Autonomous aerial navigation using monocular visual‐inertial fusion , 2018, J. Field Robotics.

[67] Jaime F. Fisac,et al. Reachability-based safe learning with Gaussian processes , 2014, 53rd IEEE Conference on Decision and Control.

[68] Mehryar Mohri,et al. Time series prediction and online learning , 2016, COLT.

[69] Germán Ros,et al. CARLA: An Open Urban Driving Simulator , 2017, CoRL.

[70] Yishay Mansour,et al. Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret , 2019, ICML.

[71] Salar Fattahi,et al. Data-Driven Sparse System Identification , 2018, 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton).