Multiscale Support Vector Learning With Projection Operator Wavelet Kernel for Nonlinear Dynamical System Identification

A giant leap has been made in the past couple of decades with the introduction of kernel-based learning as a mainstay for designing effective nonlinear computational learning algorithms. In view of the geometric interpretation of conditional expectation and the ubiquity of multiscale characteristics in highly complex nonlinear dynamic systems [1]-[3], this paper presents a new orthogonal projection operator wavelet kernel, aiming at developing an efficient computational learning approach for nonlinear dynamical system identification. In the framework of multiresolution analysis, the proposed projection operator wavelet kernel can fulfill the multiscale, multidimensional learning to estimate complex dependencies. The special advantage of the projection operator wavelet kernel developed in this paper lies in the fact that it has a closed-form expression, which greatly facilitates its application in kernel learning. To the best of our knowledge, it is the first closed-form orthogonal projection wavelet kernel reported in the literature. It provides a link between grid-based wavelets and mesh-free kernel-based methods. Simulation studies for identifying the parallel models of two benchmark nonlinear dynamical systems confirm its superiority in model accuracy and sparsity.

[1]  Jun Miura Support Vector Path Planning , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Doreen Eichel,et al.  Learning And Soft Computing Support Vector Machines Neural Networks And Fuzzy Logic Models , 2016 .

[3]  Emil Levi,et al.  Identification of complex systems based on neural and Takagi-Sugeno fuzzy model , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  G. Fasshauer Positive definite kernels: past, present and future , 2011 .

[5]  Adam Bobrowski,et al.  Functional analysis for probability and stochastic processes , 2013 .

[6]  Gilbert G. Walter,et al.  Wavelets in Closed Forms , 2001 .

[7]  Bernard Delyon,et al.  Nonlinear black-box models in system identification: Mathematical foundations , 1995, Autom..

[8]  Yoshua Bengio,et al.  Scaling learning algorithms towards AI , 2007 .

[9]  Christos Christodoulou,et al.  Support Vector Machines for Antenna Array Processing and Electromagnetics , 2006, Support Vector Machines for Antenna Array Processing and Electromagnetics.

[10]  I. M. Glazman,et al.  Theory of linear operators in Hilbert space , 1961 .

[11]  Zhao Lu,et al.  Linear programming support vector regression with wavelet kernel: A new approach to nonlinear dynamical systems identification , 2009, Math. Comput. Simul..

[12]  E FasshauerG Positive definite kernels: past, present and future , 2011 .

[13]  Zhao Lu,et al.  Linear Programming SVM-ARMA $_{\rm 2K}$ With Application in Engine System Identification , 2011, IEEE Transactions on Automation Science and Engineering.

[14]  Bo-Suk Yang,et al.  Wavelet support vector machine for induction machine fault diagnosis based on transient current signal , 2008, Expert Syst. Appl..

[15]  Y. Meyer,et al.  Wavelets: Calderón-Zygmund and Multilinear Operators , 1997 .

[16]  Gianluca Bontempi,et al.  Conditionally dependent strategies for multiple-step-ahead prediction in local learning , 2011 .

[17]  Zhao Lu,et al.  Non-Mercer hybrid kernel for linear programming support vector regression in nonlinear systems identification , 2009, Appl. Soft Comput..

[18]  Li Zhang,et al.  Wavelet support vector machine , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[19]  Sujit K. Ghosh,et al.  Essential Wavelets for Statistical Applications and Data Analysis , 2001, Technometrics.

[20]  A. Zayed,et al.  On the Use of Green's Function in Sampling Theory , 1998 .

[21]  Jochen Krebs,et al.  Support vector regression for the solution of linear integral equations , 2011 .

[22]  J. Lakey,et al.  Duration and Bandwidth Limiting: Prolate Functions, Sampling, and Applications , 2011 .

[23]  G. Weiss,et al.  Band-limited wavelets , 1993 .

[24]  Jun Zhang,et al.  Orthonormal wavelets with simple closed-form expressions , 1998, IEEE Trans. Signal Process..

[25]  Qi Wu,et al.  The forecasting model based on wavelet nu-support vector machine , 2009, Expert Syst. Appl..

[26]  R.G. Baraniuk,et al.  Compressive Sensing [Lecture Notes] , 2007, IEEE Signal Processing Magazine.

[27]  Massimo Fornasier,et al.  Compressive Sensing , 2015, Handbook of Mathematical Methods in Imaging.

[28]  S. Billings,et al.  Long term prediction of non-linear time series using multiresolution wavelet models , 2006 .

[29]  Hava T. Siegelmann,et al.  Computational capabilities of recurrent NARX neural networks , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[30]  Hong Yan,et al.  Framelet Kernels With Applications to Support Vector Regression and Regularization Networks , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[31]  S. Frick,et al.  Compressed Sensing , 2014, Computer Vision, A Reference Guide.

[32]  José Luis Rojo-Álvarez,et al.  Support Vector Machines for Nonlinear Kernel ARMA System Identification , 2006, IEEE Transactions on Neural Networks.

[33]  Stéphane Mallat,et al.  A Wavelet Tour of Signal Processing - The Sparse Way, 3rd Edition , 2008 .

[34]  Anna T. Lawniczak,et al.  Features extraction via wavelet kernel PCA for data classification , 2010, 2010 IEEE International Workshop on Machine Learning for Signal Processing.

[35]  Charalambos D. Aliprantis,et al.  An invitation to operator theory , 2002 .

[36]  I. M. Glazman,et al.  Finite-Dimensional Linear Analysis: A Systematic Presentation in Problem Form , 1974 .

[37]  G. Walter,et al.  Wavelets and Other Orthogonal Systems , 2018 .

[38]  Chris Chatfteld,et al.  Wavelet Transforms and Time-Frequency Signal Analysis , 2002, Technometrics.

[39]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[40]  Peter Tiño,et al.  Learning long-term dependencies in NARX recurrent neural networks , 1996, IEEE Trans. Neural Networks.

[41]  Karsten Urban,et al.  Wavelet Methods for Elliptic Partial Differential Equations , 2008 .

[42]  O. Nelles Nonlinear System Identification: From Classical Approaches to Neural Networks and Fuzzy Models , 2000 .

[43]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[44]  Zhao Lu,et al.  Multiscale Asymmetric Orthogonal Wavelet Kernel for Linear Programming Support Vector Learning and Nonlinear Dynamic Systems Identification , 2014, IEEE Transactions on Cybernetics.

[45]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[46]  Oliver Nelles On the identification with neural networks as series-parallel and parallel modells , 1995 .

[47]  Anthony N. Michel,et al.  Applied Algebra and Functional Analysis , 2011 .

[48]  Lennart Ljung,et al.  Nonlinear black-box modeling in system identification: a unified overview , 1995, Autom..

[49]  Peter Wittek,et al.  Compactly Supported Basis Functions as Support Vector Kernels for Classification , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Nong Zhang,et al.  Application of evolving Takagi-Sugeno fuzzy model to nonlinear system identification , 2008, Appl. Soft Comput..

[51]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[52]  C. B. Huijsmans,et al.  Characterizations of conditional expectation-type operators. , 1988 .

[53]  Bart M. ter Haar Romeny,et al.  Front-End Vision and Multi-Scale Image Analysis , 2003, Computational Imaging and Vision.

[54]  Sridhar Krishnan,et al.  Wavelet Kernel Principal Component Analysis in Noisy Multiscale Data Classification , 2012 .

[55]  Chris J. Harris,et al.  On the modelling of nonlinear dynamic systems using support vector neural networks , 2001 .

[56]  A General Sampling Theorem Associated with Differential Operators , 1999 .

[57]  M. Small,et al.  Towards long-term prediction , 2000 .

[58]  Xiaodong Wang,et al.  Robust identification of non-linear dynamic systems using support vector machine , 2006 .

[59]  John L. Junkins,et al.  Multi-Resolution Methods for Modeling and Control of Dynamical Systems , 2008 .

[60]  A. Gretton,et al.  Support vector regression for black-box system identification , 2001, Proceedings of the 11th IEEE Signal Processing Workshop on Statistical Signal Processing (Cat. No.01TH8563).

[61]  Guangyi Chen,et al.  Pattern recognition with SVM and dual-tree complex wavelets , 2007, Image Vis. Comput..

[62]  Robert Schaback,et al.  Nonstandard Kernels and their Applications , 2009 .

[63]  Holger Wendland,et al.  Kernel techniques: From machine learning to meshless methods , 2006, Acta Numerica.

[64]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[65]  Roland Opfer,et al.  Multiscale kernels , 2006, Adv. Comput. Math..