论文信息 - Learning nonlinear dynamical systems from a single trajectory

Learning nonlinear dynamical systems from a single trajectory

We introduce algorithms for learning nonlinear dynamical systems of the form $x_{t+1}=\sigma(\Theta^{\star}x_t)+\varepsilon_t$, where $\Theta^{\star}$ is a weight matrix, $\sigma$ is a nonlinear link function, and $\varepsilon_t$ is a mean-zero noise process. We give an algorithm that recovers the weight matrix $\Theta^{\star}$ from a single trajectory with optimal sample complexity and linear running time. The algorithm succeeds under weaker statistical assumptions than in previous work, and in particular i) does not require a bound on the spectral norm of the weight matrix $\Theta^{\star}$ (rather, it depends on a generalization of the spectral radius) and ii) enjoys guarantees for non-strictly-increasing link functions such as the ReLU. Our analysis has two key components: i) we give a general recipe whereby global stability for nonlinear dynamical systems can be used to certify that the state-vector covariance is well-conditioned, and ii) using these tools, we extend well-known algorithms for efficiently learning generalized linear models to the dependent setting.

Dylan J. Foster | Tuhin Sarkar | Alexander Rakhlin | A. Rakhlin | Tuhin Sarkar

[1] Munther A. Dahleh,et al. Finite-Time System Identification for Partially Observed LTI Systems of Unknown Order , 2019, ArXiv.

[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[3] K. Poolla,et al. Robust performance against time-varying structured perturbations , 1995, IEEE Trans. Autom. Control..

[4] Benjamin Recht,et al. Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator , 2017, ICML.

[5] Anders Rantzer,et al. Distributed control of positive systems , 2011, IEEE Conference on Decision and Control and European Control Conference.

[6] Lennart Ljung,et al. System Identification: Theory for the User , 1987 .

[7] Max Simchowitz,et al. Naive Exploration is Optimal for Online LQR , 2020, ICML.

[8] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[9] Karthik Sridharan,et al. Learning with Square Loss: Localization through Offset Rademacher Complexity , 2015, COLT.

[10] V. Koltchinskii. Local Rademacher complexities and oracle inequalities in risk minimization , 2006, 0708.0083.

[11] Garvesh Raskutti,et al. Learning High-Dimensional Generalized Linear Autoregressive Models , 2019, IEEE Transactions on Information Theory.

[12] Karthik Sridharan,et al. Online Nonparametric Regression , 2014, ArXiv.

[13] A. Montanari,et al. The landscape of empirical risk for nonconvex losses , 2016, The Annals of Statistics.

[14] Csaba Szepesvári,et al. Regret Bounds for the Adaptive Control of Linear Quadratic Systems , 2011, COLT.

[15] Roman Vershynin,et al. Introduction to the non-asymptotic analysis of random matrices , 2010, Compressed Sensing.

[16] T. Lai,et al. Self-Normalized Processes: Limit Theory and Statistical Applications , 2001 .

[17] Adam Tauman Kalai,et al. The Isotron Algorithm: High-Dimensional Isotonic Regression , 2009, COLT.

[18] Karan Singh,et al. Learning Linear Dynamical Systems via Spectral Filtering , 2017, NIPS.

[19] Samet Oymak,et al. Non-asymptotic and Accurate Learning of Nonlinear Dynamical Systems , 2020, J. Mach. Learn. Res..

[20] S. R. Jammalamadaka,et al. Empirical Processes in M-Estimation , 2001 .

[21] Yi Zhang,et al. Spectral Filtering for General Linear Dynamical Systems , 2018, NeurIPS.

[22] Martin J. Wainwright,et al. High-Dimensional Statistics , 2019 .

[23] Nikolai Matni,et al. Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator , 2018, NeurIPS.

[24] Maria-Florina Balcan,et al. Active and passive learning of linear separators under log-concave distributions , 2012, COLT.

[25] Samet Oymak,et al. Stochastic Gradient Descent Learns State Equations with Nonlinear Activations , 2018, COLT.

[26] Adam Tauman Kalai,et al. Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression , 2011, NIPS.

[27] Y. Baraud,et al. ADAPTIVE ESTIMATION IN AUTOREGRESSION OR β-MIXING REGRESSION VIA MODEL SELECTION By , 2001 .

[28] Alexander Rakhlin,et al. Near optimal finite time identification of arbitrary linear dynamical systems , 2018, ICML.

[29] Alexandre B. Tsybakov,et al. Introduction to Nonparametric Estimation , 2008, Springer series in statistics.

[30] Justin Romberg,et al. Convex Programming for Estimation in Nonlinear Recurrent Models , 2019, ArXiv.

[31] Trevor Hastie,et al. Statistical Learning with Sparsity: The Lasso and Generalizations , 2015 .

[32] J. Shamma. Robust stability with time-varying structured uncertainty , 1994, IEEE Trans. Autom. Control..

[33] Nikolai Matni,et al. On the Sample Complexity of the Linear Quadratic Regulator , 2017, Foundations of Computational Mathematics.

[34] John Langford,et al. Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits , 2014, ICML.

[35] Erik Weyer,et al. Finite sample properties of system identification methods , 2002, IEEE Trans. Autom. Control..

[36] Lawrence K. Saul,et al. Kernel Methods for Deep Learning , 2009, NIPS.

[37] Benjamin Recht,et al. Certainty Equivalent Control of LQR is Efficient , 2019, ArXiv.

[38] Michael I. Jordan,et al. Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification , 2018, COLT.

[39] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[40] P. Bartlett,et al. Local Rademacher complexities , 2005, math/0508275.

[41] Ambuj Tewari,et al. Online learning via sequential complexities , 2010, J. Mach. Learn. Res..

[42] Max Simchowitz,et al. Learning Linear Dynamical Systems with Semi-Parametric Least Squares , 2019, COLT.

[43] S. Mendelson,et al. Regularization and the small-ball method I: sparse recovery , 2016, 1601.05584.