The quest for the right kernel in Bayesian impulse response identification: The use of OBFs

Kernel-based regularization approaches for impulse response estimation of Linear Time-Invariant (LTI) systems have received a lot of attention recently. The reason is that regularized least-squares estimators may achieve a favorable bias/variance trade-off compared with classical Prediction Error Minimization (PEM) methods. To fully exploit this property, the kernel function needs to capture relevant aspects of the data-generating system at hand. Hence, it is important to design automatic procedures for kernel design based on data or prior knowledge. The kernel models, so far introduced, focus on encoding smoothness and BIBO-stability of the expected impulse response while other properties, like oscillatory behavior or the presence of fast and slow poles, have not been successfully implemented in kernel design. Inspired by the representation theory of dynamical systems, we show how to build stable kernels that are able to capture particular aspects of system dynamics via the use of Orthonormal Basis Functions (OBFs). In particular, desired dynamic properties can be easily encoded via the generating poles of OBFs. Such poles are seen as hyperparameters which are tuned via marginal likelihood optimization. Special cases of our kernel construction include Laguerre, Kautz, and Generalized OBFs (GOBFs)-based kernel structures. Monte-Carlo simulations show that the OBFs-based kernels perform well compared with stable spline/TC kernels, especially for slow systems with dominant poles close to the unit circle. Moreover, the capability of Kautz basis to model resonating systems is also shown.

[1]  Francesca P. Carli,et al.  Efficient algorithms for large scale linear system identification using stable spline estimators , 2012 .

[2]  A. Rukhin Bayes and Empirical Bayes Methods for Data Analysis , 1997 .

[3]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[4]  Mohamed Darwishy,et al.  Perspectives of orthonormal basis functions based kernels in Bayesian system identification? , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[5]  Lennart Ljung,et al.  Spectral analysis of the DC kernel for regularized system identification , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[6]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[7]  Graham C. Goodwin,et al.  Estimated Transfer Functions with Application to Model Order Selection , 1992 .

[8]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[9]  Håkan Hjalmarsson,et al.  Sparse Estimation of Polynomial and Rational , 2014 .

[10]  L. Breiman Better subset regression using the nonnegative garrote , 1995 .

[11]  H. Hjalmarsson,et al.  Identification of Box-Jenkins Models Using Structured ARX Models and Nuclear Norm Relaxation , 2012 .

[12]  R. Tóth,et al.  Bayesian Frequency Domain Identification of LTI Systems with OBFs kernels , 2017 .

[13]  T. Söderström,et al.  Instrumental variable methods for system identification , 1983 .

[14]  Grace Wahba,et al.  Spline Models for Observational Data , 1990 .

[15]  Rik Pintelon,et al.  System Identification: A Frequency Domain Approach , 2012 .

[16]  P. V. D. Hof,et al.  A generalized orthonormal basis for linear dynamical systems , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.

[17]  Lennart Ljung,et al.  On the design of multiple kernels for nonparametric linear system identification , 2014, 53rd IEEE Conference on Decision and Control.

[18]  B. Wahlberg System identification using Laguerre models , 1991 .

[19]  B. Wahlberg System identification using Kautz models , 1994, IEEE Trans. Autom. Control..

[20]  Lennart Ljung,et al.  Kernel methods in system identification, machine learning and function estimation: A survey , 2014, Autom..

[21]  Giuseppe De Nicolao,et al.  A new kernel-based approach for linear system identification , 2010, Autom..

[22]  Roland Toth,et al.  Modeling and Identification of Linear Parameter-Varying Systems , 2010 .

[23]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[24]  R. Curtain,et al.  Realisation and approximation of linear infinite-dimensional systems with error bounds , 1988 .

[25]  Tomás Oliveira e Silva,et al.  A N-Width Result for the Generalized Orthonormal Basis Function Model , 1996 .

[26]  C. Carmeli,et al.  VECTOR VALUED REPRODUCING KERNEL HILBERT SPACES OF INTEGRABLE FUNCTIONS AND MERCER THEOREM , 2006 .

[27]  Gianluigi Pillonetto,et al.  Bayes and empirical Bayes semi-blind deconvolution using eigenfunctions of a prior covariance , 2007, Autom..

[28]  Alessandro Chiuso,et al.  Prediction error identification of linear systems: A nonparametric Gaussian regression approach , 2011, Autom..

[29]  Henrik Ohlsson,et al.  On the estimation of transfer functions, regularizations and Gaussian processes - Revisited , 2012, Autom..

[30]  Tomas Oliveira,et al.  Rational Orthonormal Functions on the Unit Circle and on the Imaginary Axis, with Applications in System Identification , 1999 .

[31]  H. Akaike A new look at the statistical model identification , 1974 .

[32]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[33]  Alessandro Chiuso,et al.  Tuning complexity in regularized kernel-based regression and linear system identification: The robustness of the marginal likelihood estimator , 2015, Autom..

[34]  Alessandro Chiuso,et al.  The role of vector autoregressive modeling in predictor-based subspace identification , 2007, Autom..

[35]  Lennart Ljung,et al.  Implementation of algorithms for tuning parameters in regularized least squares problems in system identification , 2013, Autom..

[36]  Lennart Ljung,et al.  Regularized system identification using orthonormal basis functions , 2015, 2015 European Control Conference (ECC).

[37]  R. Pearson Discrete-time Dynamic Models , 1999 .

[38]  J. Mercer Functions of positive and negative type, and their connection with the theory of integral equations , 1909 .

[39]  Giuseppe De Nicolao,et al.  Kernel selection in linear system identification Part I: A Gaussian process perspective , 2011, IEEE Conference on Decision and Control and European Control Conference.

[40]  Roland Tóth,et al.  Asymptotically optimal orthonormal basis functions for LPV system identification , 2009, Autom..

[41]  N. Aronszajn Theory of Reproducing Kernels. , 1950 .