Recursive multikernel filters exploiting nonlinear temporal structure

In kernel methods, temporal information on the data is commonly included by using time-delayed embeddings as inputs. Recently, an alternative formulation was proposed by defining a γ-filter explicitly in a reproducing kernel Hilbert space, giving rise to a complex model where multiple kernels operate on different temporal combinations of the input signal. In the original formulation, the kernels are then simply combined to obtain a single kernel matrix (for instance by averaging), which provides computational benefits but discards important information on the temporal structure of the signal. Inspired by works on multiple kernel learning, we overcome this drawback by considering the different kernels separately. We propose an efficient strategy to adaptively combine and select these kernels during the training phase. The resulting batch and online algorithms automatically learn to process highly nonlinear temporal information extracted from the input signal, which is implicitly encoded in the kernel values. We evaluate our proposal on several artificial and real tasks, showing that it can outperform classical approaches both in batch and online settings.

[1]  José Luis Rojo-Álvarez,et al.  Explicit Recursive and Adaptive Filtering in Reproducing Kernel Hilbert Spaces , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Masahiro Yukawa,et al.  Online model selection and learning by multikernel adaptive filtering , 2013, 21st European Signal Processing Conference (EUSIPCO 2013).

[3]  Wei Xing Zheng,et al.  Identification of a Class of Nonlinear Autoregressive Models With Exogenous Inputs Based on Kernel Machines , 2011, IEEE Transactions on Signal Processing.

[4]  José Carlos Príncipe,et al.  The gamma-filter-a new class of adaptive IIR filters with restricted feedback , 1993, IEEE Trans. Signal Process..

[5]  Shie Mannor,et al.  The kernel recursive least-squares algorithm , 2004, IEEE Transactions on Signal Processing.

[6]  José Luis Rojo-Álvarez,et al.  Support Vector Machines for Nonlinear Kernel ARMA System Identification , 2006, IEEE Transactions on Neural Networks.

[7]  Benjamin Schrauwen,et al.  Recurrent Kernel Machines: Computing with Infinite Echo State Networks , 2012, Neural Computation.

[8]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[9]  Masahiro Yukawa,et al.  Adaptive Learning in Cartesian Product of Reproducing Kernel Hilbert Spaces , 2014, IEEE Transactions on Signal Processing.

[10]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[11]  Masahiro Yukawa,et al.  Multikernel Adaptive Filtering , 2012, IEEE Transactions on Signal Processing.

[12]  Floris Ernst,et al.  Compensating for Quasi-periodic Motion in Robotic Radiosurgery , 2011 .

[13]  José Carlos Príncipe,et al.  Mixture kernel least mean square , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[14]  Kan Li,et al.  The Kernel Adaptive Autoregressive-Moving-Average Algorithm , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[15]  J. Suykens,et al.  Recurrent least squares support vector machines , 2000 .

[16]  Badong Chen,et al.  A FIXED-BUDGET QUANTIZED KERNEL LEAST MEAN SQUARE ALGORITHM , 2012 .

[17]  S. Haykin,et al.  Kernel Least‐Mean‐Square Algorithm , 2010 .

[18]  Bernard Zenko,et al.  Is Combining Classifiers with Stacking Better than Selecting the Best One? , 2004, Machine Learning.

[19]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[20]  Leo Breiman,et al.  Stacked regressions , 2004, Machine Learning.

[21]  R. Schaback,et al.  Recursive Kernels , 2009 .

[22]  Danilo Comminiello,et al.  Diffusion spline adaptive filtering , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[23]  Miguel Lázaro-Gredilla,et al.  Kernel Recursive Least-Squares Tracker for Time-Varying Regression , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[24]  F. Girosi,et al.  Nonlinear prediction of chaotic time series using support vector machines , 1997, Neural Networks for Signal Processing VII. Proceedings of the 1997 IEEE Signal Processing Society Workshop.

[25]  Ali H. Sayed,et al.  Combinations of Adaptive Filters: Performance and convergence properties , 2021, IEEE Signal Processing Magazine.

[26]  Richard D. Braatz,et al.  On the "Identification and control of dynamical systems using neural networks" , 1997, IEEE Trans. Neural Networks.