Statistical Results on Filtering and Epi-convergence for Learning-Based Model Predictive Control

Abstract : Learning-based model predictive control (LBMPC) is a technique that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance. This technical note provides a result that elucidates the reasons for the choice of measurement model used with LBMPC, and it gives proofs concerning the stochastic convergence of LBMPC. The first part of this note discusses simultaneous state estimation and statistical identification (or learning) of unmodeled dynamics, for dynamical systems that can be described by ordinary differential equations (ODE's). The second part provides proofs concerning the epi-convergence of different statistical estimators that can be used with the LBMPC technique. In particular, we prove results on the statistical properties of a nonparametric estimator that we have designed to have the correct deterministic and stochastic properties for numerical implementation when used in conjunction with LBMPC.

[1]  E. Malinvaud The Consistency of Nonlinear Regressions , 1970 .

[2]  Jianqing Fan,et al.  Data‐Driven Bandwidth Selection in Local Polynomial Fitting: Variable Bandwidth and Spatial Adaptation , 1995 .

[3]  W. Rudin Principles of mathematical analysis , 1964 .

[4]  Petr Lachout,et al.  On continuous convergence and epi-convergence of random functions. Part II: Sufficient conditions and applications , 2003, Kybernetika.

[5]  Karl Johan Åström,et al.  BOOK REVIEW SYSTEM IDENTIFICATION , 1994, Econometric Theory.

[6]  Petr Lachout,et al.  On continuous convergence and epi-convergence of random functions. Part I: Theory and relations , 2003, Kybernetika.

[7]  R. F.,et al.  Mathematical Statistics , 1944, Nature.

[8]  A. V. D. Vaart,et al.  Asymptotic Statistics: U -Statistics , 1998 .

[9]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[10]  K. Ball CONVEX BODIES: THE BRUNN–MINKOWSKI THEORY , 1994 .

[11]  S. Shankar Sastry,et al.  Provably safe and robust learning-based model predictive control , 2011, Autom..

[12]  R. Jennrich Asymptotic Properties of Non-Linear Least Squares Estimators , 1969 .

[13]  M. Wand,et al.  Multivariate Locally Weighted Least Squares Regression , 1994 .

[14]  R. Rajendiran,et al.  Topological Spaces , 2019, A Physicist's Introduction to Algebraic Structures.

[15]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[16]  H. Müller Weighted Local Regression and Kernel Methods for Nonparametric Curve Fitting , 1987 .

[17]  Claire J. Tomlin,et al.  Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter , 2012, 2012 American Control Conference (ACC).

[18]  H. Robbins,et al.  Strong consistency of least squares estimates in multiple regression. , 1979, Proceedings of the National Academy of Sciences of the United States of America.