Accurate Probabilistic Error Bound for Eigenvalues of Kernel Matrix

The eigenvalues of the kernel matrix play an important role in a number of kernel methods. It is well known that these eigenvalues converge as the number of samples tends to infinity. We derive a probabilistic finite sample size bound on the approximation error of an individual eigenvalue, which has the important property that the bound scales with the dominate eigenvalue under consideration, reflecting the accurate behavior of the approximation error as predicted by asymptotic results and observed in numerical simulations. Under practical conditions, the bound presented here forms a significant improvement over existing non-scaling bound. Applications of this theoretical finding in kernel matrix selection and kernel target alignment are also presented.

[1]  V. Koltchinskii,et al.  Random matrix approximation of spectra of integral operators , 2000 .

[2]  Mikio L. Braun,et al.  Accurate Error Bounds for the Eigenvalues of the Kernel Matrix , 2006, J. Mach. Learn. Res..

[3]  Shawn Martin,et al.  The Numerical Stability of Kernel Methods , 2006, AI&M.

[4]  Alexander Gammerman,et al.  Ridge Regression Learning Algorithm in Dual Variables , 1998, ICML.

[5]  L. Trefethen,et al.  Numerical linear algebra , 1997 .

[6]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, International Conference on Artificial Neural Networks.

[7]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[8]  Michael I. Jordan,et al.  Learning with Mixtures of Trees , 2001, J. Mach. Learn. Res..

[9]  Mikio L. Braun,et al.  Spectral properties of the kernel matrix and their relation to kernel methods in machine learning , 2005 .

[10]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[11]  Michael I. Jordan,et al.  Link Analysis, Eigenvectors and Stability , 2001, IJCAI.

[12]  Nello Cristianini,et al.  On the eigenspectrum of the gram matrix and the generalization error of kernel-PCA , 2005, IEEE Transactions on Information Theory.

[13]  Nello Cristianini,et al.  On the Concentration of Spectral Properties , 2001, NIPS.

[14]  V. Koltchinskii Asymptotics of Spectral Projections of Some Random Matrices Approximating Integral Operators , 1998 .

[15]  Yoshua Bengio,et al.  Spectral Clustering and Kernel PCA are Learning Eigenfunctions , 2003 .

[16]  N. Cristianini,et al.  On Kernel-Target Alignment , 2001, NIPS.

[17]  Gilles Blanchard,et al.  On the Convergence of Eigenspaces in Kernel Principal Component Analysis , 2005, NIPS.