Finite-Data Performance Guarantees for the Output-Feedback Control of an Unknown System

As the systems we control become more complex, first-principle modeling becomes either impossible or intractable, motivating the use of machine learning techniques for the control of systems with continuous action spaces. As impressive as the empirical successes of these methods have been, strong theoretical guarantees of performance, safety, or robustness are few and far between. This paper takes a step towards providing such guarantees by establishing finite-data performance guarantees for the robust output-feedback control of an unknown FIR SISO system. In particular, we introduce the “Coarse-ID control” pipeline, which is composed of a system identification step followed by a robust controller synthesis procedure, and analyze its end-to-end performance, providing quantitative bounds on the performance degradation suffered due to model uncertainty as a function of the number of experiments used to identify the system. We conclude with numerical examples demonstrating the effectiveness of our method.

[1]  Biao Huang,et al.  System Identification , 2000, Control Theory for Physicists.

[2]  Benjamin Recht,et al.  Non-Asymptotic Analysis of Robust Control from Coarse-Grained Identification , 2017, ArXiv.

[3]  Alexander Goldenshluger,et al.  Nonparametric Estimation of Transfer Functions: Rates of Convergence and Adaptation , 1998, IEEE Trans. Inf. Theory.

[4]  Gábor Lugosi,et al.  Concentration Inequalities - A Nonasymptotic Theory of Independence , 2013, Concentration Inequalities.

[5]  Parikshit Shah,et al.  Linear system identification via atomic norm regularization , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[6]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[7]  Nikolai Matni,et al.  Scalable system level synthesis for virtually localizable systems , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[8]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[9]  Stefan Schaal,et al.  On the design of LQR kernels for efficient controller learning , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[10]  Nikolai Matni,et al.  A System-Level Approach to Controller Synthesis , 2016, IEEE Transactions on Automatic Control.

[11]  Petre Stoica,et al.  Decentralized Control , 2018, The Control Systems Handbook.

[12]  Nikolai Matni,et al.  On the Sample Complexity of the Linear Quadratic Regulator , 2017, Foundations of Computational Mathematics.

[13]  Erik Weyer,et al.  Finite sample properties of system identification methods , 2002, IEEE Trans. Autom. Control..

[14]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[15]  Sergey Levine,et al.  Goal-driven dynamics learning via Bayesian optimization , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[16]  W. Marsden I and J , 2012 .

[17]  B. Dumitrescu Positive Trigonometric Polynomials and Signal Processing Applications , 2007 .

[18]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[19]  Sham M. Kakade,et al.  Global Convergence of Policy Gradient Methods for Linearized Control Problems , 2018, ICML 2018.