Sliding Window Generalized Kernel Affine Projection Algorithm Using Projection Mappings

Very recently, a solution to the kernel-based online classification problem has been given by the adaptive projected subgradient method (APSM). The developed algorithm can be considered as a generalization of a kernel affine projection algorithm (APA) and the kernel normalized least mean squares (NLMS). Furthermore, sparsification of the resulting kernel series expansion was achieved by imposing a closed ball (convex set) constraint on the norm of the classifiers. This paper presents another sparsification method for the APSM approach to the online classification task by generating a sequence of linear subspaces in a reproducing kernel Hilbert space (RKHS). To cope with the inherent memory limitations of online systems and to embed tracking capabilities to the design, an upper bound on the dimension of the linear subspaces is imposed. The underlying principle of the design is the notion of projection mappings. Classification is performed by metric projection mappings, sparsification is achieved by orthogonal projections, while the online system's memory requirements and tracking are attained by oblique projections. The resulting sparsification scheme shows strong similarities with the classical sliding window adaptive schemes. The proposed design is validated by the adaptive equalization problem of a nonlinear communication channel, and is compared with classical and recent stochastic gradient descent techniques, as well as with the APSM's solution where sparsification is performed by a closed ball constraint on the norm of the classifiers.

[1]  John F. Doherty,et al.  Generalized projection algorithm for blind interference suppression in DS/CDMA communications , 1997 .

[2]  Alexander J. Smola,et al.  Online learning with kernels , 2001, IEEE Transactions on Signal Processing.

[3]  Sheng Chen,et al.  A clustering technique for digital communications channel equalization using radial basis function networks , 1993, IEEE Trans. Neural Networks.

[4]  I. Yamada,et al.  Online Sparse Kernel-Based Classification by Projections , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[5]  Shie Mannor,et al.  The kernel recursive least-squares algorithm , 2004, IEEE Transactions on Signal Processing.

[6]  G. Wahba Multivariate Function and Operator Estimation, Based on Smoothing Splines and Reproducing Kernels , 1992 .

[7]  O. Bousquet,et al.  Kernel methods and their potential use in signal processing , 2004, IEEE Signal Processing Magazine.

[8]  I. Yamada,et al.  Pairwise Optimal Weight Realization—Acceleration Technique for Set-Theoretic Adaptive Parallel Subgradient Projection Algorithm , 2006, IEEE Transactions on Signal Processing.

[9]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[10]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[11]  Paulo Sergio Ramirez,et al.  Fundamentals of Adaptive Filtering , 2002 .

[12]  Theo J. A. de Vries,et al.  Pruning error minimization in least squares support vector machines , 2003, IEEE Trans. Neural Networks.

[13]  I. Yamada,et al.  The Adaptive Projected Subgradient Method over the Fixed Point Set of Strongly Attracting Nonexpansive Mappings , 2006 .

[14]  P. Diniz,et al.  Low-complexity constrained affine-projection algorithms , 2005, IEEE Transactions on Signal Processing.

[15]  Isao Yamada,et al.  Adaptive Parallel Quadratic-Metric Projection Algorithms , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  A. Ekpenyong,et al.  Frequency-domain adaptive filtering: a set-membership approach , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[17]  J. Nagumo,et al.  A learning method for system identification , 1967, IEEE Transactions on Automatic Control.

[18]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[19]  Heinz H. Bauschke,et al.  On Projection Algorithms for Solving Convex Feasibility Problems , 1996, SIAM Rev..

[20]  N. K. Bose,et al.  Digital filters: Theory and applications , 1985 .

[21]  José Antonio Apolinário,et al.  Constrained adaptation algorithms employing Householder transformation , 2002, IEEE Trans. Signal Process..

[22]  Adi Ben-Israel,et al.  Generalized inverses: theory and applications , 1974 .

[23]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[24]  Isao Yamada,et al.  An efficient robust adaptive filtering algorithm based on parallel subgradient projection techniques , 2002, IEEE Trans. Signal Process..

[25]  Kazuhiko Ozeki,et al.  An adaptive filtering algorithm using an orthogonal projection to an affine subspace and its properties , 1984 .

[26]  Ignacio Santamaría,et al.  A Sliding-Window Kernel RLS Algorithm and Its Application to Nonlinear Channel Identification , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[27]  Shirish Nagaraj,et al.  Set-membership filtering and a set-membership normalized LMS algorithm with an adaptive step size , 1998, IEEE Signal Processing Letters.

[28]  J. Mercer Functions of Positive and Negative Type, and their Connection with the Theory of Integral Equations , 1909 .

[29]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[30]  N. Aronszajn Theory of Reproducing Kernels. , 1950 .

[31]  Robert F. Harrison,et al.  Reduction of kernel models , 2003 .

[32]  Johan A. K. Suykens,et al.  Weighted least squares support vector machines: robustness and sparse approximation , 2002, Neurocomputing.

[33]  T. Hinamoto,et al.  Extended theory of learning identification , 1975 .

[34]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[35]  Yih-Fang Huang,et al.  Kernelized set-membership approach to nonlinear adaptive filtering , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[36]  P. Diniz,et al.  Set-membership affine projection algorithm , 2001, IEEE Signal Processing Letters.

[37]  D. Luenberger Optimization by Vector Space Methods , 1968 .

[38]  I. Yamada,et al.  Adaptive Projected Subgradient Method for Asymptotic Minimization of Sequence of Nonnegative Convex Functions , 2005 .

[39]  D. Meeter Stochastic Approximation and Nonlinear Regression , 1969 .