Robust Online Control with Model Misspecification

We study online control of an unknown nonlinear dynamical system that is approximated by a time-invariant linear system with model misspecification. Our study focuses on robustness , a measure of how much deviation from the assumed linear approximation can be tolerated by a controller while maintaining finite ℓ 2 -gain. A basic methodology to analyze robustness is via the small gain theorem. However, as an implication of recent lower bounds on adaptive control, this method can only yield robustness that is exponentially small in the dimension of the system and its parametric uncertainty. The work of Cusumano and Poolla (1988a) shows that much better robustness can be obtained, but the control algorithm is inefficient, taking exponential time in the worst case. In this paper we investigate whether there exists an efficient algorithm with provable robustness beyond the small gain theorem. We demonstrate that for a fully actuated system, this is indeed attainable. We give an efficient controller that can tolerate robustness that is polynomial in the dimension and independent of the parametric uncertainty; furthermore, the controller obtains an ℓ 2 -gain whose dimension dependence is near optimal.

[1]  G. Zames On the input-output stability of time-varying nonlinear feedback systems Part one: Conditions derived using concepts of loop gain, conicity, and positivity , 1966 .

[2]  L. Valavani,et al.  Robustness of adaptive control algorithms in the presence of unmodeled dynamics , 1982, 1982 21st IEEE Conference on Decision and Control.

[3]  Anuradha M. Annaswamy,et al.  Robust Adaptive Control , 1984, 1984 American Control Conference.

[4]  K. Poolla,et al.  Nonlinear feedback vs. linear feedback for robust stabilization , 1988, Proceedings of the 27th IEEE Conference on Decision and Control.

[5]  Dennis S. Bernstein,et al.  Is there More to Robust Control Theory than Small Gain? , 1992, 1992 American Control Conference.

[6]  Roman Vershynin,et al.  Introduction to the non-asymptotic analysis of random matrices , 2010, Compressed Sensing.

[7]  Mi-Ching Tsai,et al.  Robust and Optimal Control , 2014 .

[8]  Gang Tao,et al.  Multivariable adaptive control: A survey , 2014, Autom..

[9]  Michael I. Jordan,et al.  Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification , 2018, COLT.

[10]  Avinatan Hassidim,et al.  Online Linear Quadratic Control , 2018, ICML.

[11]  Max Simchowitz,et al.  Learning Linear Dynamical Systems with Semi-Parametric Least Squares , 2019, COLT.

[12]  Ambuj Tewari,et al.  Finite Time Adaptive Stabilization of LQ Systems , 2018, ArXiv.

[13]  Alexander Rakhlin,et al.  Near optimal finite time identification of arbitrary linear dynamical systems , 2018, ICML.

[14]  Sham M. Kakade,et al.  Online Control with Adversarial Disturbances , 2019, ICML.

[15]  Samet Oymak,et al.  Non-asymptotic Identification of LTI Systems from a Single Trajectory , 2018, 2019 American Control Conference (ACC).

[16]  S. Kakade,et al.  The Nonstochastic Control Problem , 2019, ALT.

[17]  Karan Singh,et al.  No-Regret Prediction in Marginally Stable Systems , 2020, COLT.

[18]  Soon-Jo Chung,et al.  Online Optimization with Memory and Competitive Control , 2020, NeurIPS.

[19]  S. Kakade,et al.  Information Theoretic Regret Bounds for Online Nonlinear Control , 2020, NeurIPS.

[20]  Akshay Krishnamurthy,et al.  Learning the Linear Quadratic Regulator from Nonlinear Observations , 2020, NeurIPS.

[21]  Max Simchowitz,et al.  Improper Learning for Non-Stochastic Control , 2020, COLT.

[22]  Yisong Yue,et al.  Competitive Control with Delayed Imperfect Information , 2020, 2022 American Control Conference (ACC).

[23]  Spencer M. Richards,et al.  Adaptive Robust Model Predictive Control with Matched and Unmatched Uncertainty , 2021, 2022 American Control Conference (ACC).

[24]  Elad Hazan,et al.  Black-Box Control for Linear Dynamical Systems , 2020, COLT.

[25]  Jean-Jacques E. Slotine,et al.  Regret Bounds for Adaptive Nonlinear Control , 2020, L4DC.

[26]  Naman Agarwal,et al.  A Regret Minimization Approach to Iterative Learning Control , 2021, ICML.

[27]  Babak Hassibi,et al.  Competitive Control , 2021, ArXiv.

[28]  Michael I. Jordan,et al.  Active Learning for Nonlinear System Identification with Guarantees , 2020, J. Mach. Learn. Res..

[29]  A. Megretski Lower and upper bounds for optimal L 2 gain nonlinear robust control of first order linear system , 2022 .