Online Learning with Self-tuned Gaussian Kernels: Good Kernel-initialization by Multiscale Screening

We propose an efficient adaptive update method for the kernel parameters: the kernel coefficients, scales and centers. The mirror descent and the steepest descent method for squared error cost function are employed to update the kernel scales and centers, respectively. Although the problem considered in this paper is nonconvex, we reduce the possibility of falling into local minima by using a novel multiple initialization scheme to grow the dictionary without great increases of the dictionary size. Through computer experiments, we show that the proposed algorithm enjoys a high adaptation-capability while maintaining a small dictionary size, without detailed tuning of the initial kernel parameters.

[1]  Toshihisa Tanaka,et al.  Doubly adaptive kernel filtering , 2017, 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).

[2]  N. Aronszajn Theory of Reproducing Kernels. , 1950 .

[3]  John Darzentas,et al.  Problem Complexity and Method Efficiency in Optimization , 1983 .

[4]  Weifeng Liu,et al.  The Kernel Least-Mean-Square Algorithm , 2008, IEEE Transactions on Signal Processing.

[5]  Toshihisa Tanaka,et al.  Dictionary Learning for Gaussian Kernel Adaptive Filtering with Variablekernel Center and Width , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Badong Chen,et al.  Quantized Kernel Least Mean Square Algorithm , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Sergios Theodoridis,et al.  Online Kernel-Based Classification Using Adaptive Projection Algorithms , 2008, IEEE Transactions on Signal Processing.

[8]  Alexander J. Smola,et al.  Online learning with kernels , 2001, IEEE Transactions on Signal Processing.

[9]  Miguel Lázaro-Gredilla,et al.  Kernel Recursive Least-Squares Tracker for Time-Varying Regression , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Isao Yamada,et al.  A sparse adaptive filtering using time-varying soft-thresholding techniques , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  Masahiro Yukawa,et al.  Online Model-Selection and Learning for Nonlinear Estimation Based on Multikernel Adaptive Filtering , 2017, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[12]  Masahiro Yukawa,et al.  Multikernel Adaptive Filtering , 2012, IEEE Transactions on Signal Processing.

[13]  Masahiro Yukawa,et al.  Online model selection and learning by multikernel adaptive filtering , 2013, 21st European Signal Processing Conference (EUSIPCO 2013).

[14]  Masahiro Yukawa,et al.  Adaptive Nonlinear Estimation Based on Parallel Projection Along Affine Subspaces in Reproducing Kernel Hilbert Space , 2015, IEEE Transactions on Signal Processing.

[15]  Shie Mannor,et al.  The kernel recursive least-squares algorithm , 2004, IEEE Transactions on Signal Processing.

[16]  Isao Yamada,et al.  Sparsity-aware adaptive filters based on ℓp-norm inspired soft-thresholding technique , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[17]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[18]  Paul Honeine,et al.  Online Kernel Adaptive Algorithms With Dictionary Adaptation for MIMO Models , 2013, IEEE Signal Processing Letters.

[19]  Masahiro Yukawa,et al.  Adaptive Learning in Cartesian Product of Reproducing Kernel Hilbert Spaces , 2014, IEEE Transactions on Signal Processing.

[20]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[21]  Jie Chen,et al.  Online Dictionary Learning for Kernel LMS , 2014, IEEE Transactions on Signal Processing.

[22]  Masahiro Yukawa,et al.  Efficient Dictionary-Refining Kernel Adaptive Filter With Fundamental Insights , 2016, IEEE Transactions on Signal Processing.

[23]  Paul Honeine,et al.  Online Prediction of Time Series Data With Kernels , 2009, IEEE Transactions on Signal Processing.

[24]  Nanning Zheng,et al.  Kernel least mean square with adaptive kernel size , 2014, Neurocomputing.