Non-linear State-space Model Identification from Video Data using Deep Encoders

Identifying systems with high-dimensional inputs and outputs, such as systems measured by video streams, is a challenging problem with numerous applications in robotics, autonomous vehicles and medical imaging. In this paper, we propose a novel non-linear state-space identification method starting from high-dimensional input and output data. Multiple computational and conceptual advances are combined to handle the high-dimensional nature of the data. An encoder function, represented by a neural network, is introduced to learn a reconstructability map to estimate the model states from past inputs and outputs. This encoder function is jointly learned with the dynamics. Furthermore, multiple computational improvements, such as an improved reformulation of multiple shooting and batch optimization, are proposed to keep the computational time under control when dealing with high-dimensional and large datasets. We apply the proposed method to a video stream of a simulated environment of a controllable ball in a unit box. The simulation study shows low simulation error with excellent long term prediction for the obtained model using the proposed method.

[1]  Thomas B. Schön,et al.  Learning deep dynamical models from image pixels , 2014, ArXiv.

[2]  片山 徹 Subspace methods for system identification , 2005 .

[3]  Maarten Schoukens,et al.  On the Initialization of Nonlinear LFR Model Identification with the Best Linear Approximation , 2020, ArXiv.

[4]  Alberto Bemporad,et al.  Learning Nonlinear State-Space Models Using Deep Autoencoders , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  Brett Ninness,et al.  Robust maximum-likelihood estimation of multivariable dynamic systems , 2005, Autom..

[7]  Toshiyuki Ohtsuka,et al.  Model structure simplification of Nonlinear Systems via immersion , 2005, IEEE Transactions on Automatic Control.

[8]  Thomas B. Schön,et al.  From Pixels to Torques: Policy Learning with Deep Dynamical Models , 2015, ICML 2015.

[9]  Marco Forgione,et al.  Integrated Neural Networks for Nonlinear Continuous-Time System Identification , 2020, IEEE Control Systems Letters.

[10]  Thomas Bo Schön,et al.  Variational Nonlinear State Estimation. , 2020 .

[11]  Andrew R. Barron,et al.  Universal approximation bounds for superpositions of a sigmoidal function , 1993, IEEE Trans. Inf. Theory.

[12]  Oliver Nelles,et al.  Iterative Excitation Signal Design for Nonlinear Dynamic Black-Box Models , 2017, KES.

[13]  J. Schoukens,et al.  Tuning nonlinear state-space models using unconstrained multiple shooting , 2020, IFAC-PapersOnLine.

[14]  Catholijn M. Jonker,et al.  Model-based Reinforcement Learning: A Survey , 2020, ArXiv.

[15]  Bart De Moor,et al.  Subspace Identification for Linear Systems: Theory ― Implementation ― Applications , 2011 .