论文信息 - Missing Data in Kernel PCA

Missing Data in Kernel PCA

Kernel Principal Component Analysis (KPCA) is a widely used technique for visualisation and feature extraction. Despite its success and flexibility, the lack of a probabilistic interpretation means that some problems, such as handling missing or corrupted data, are very hard to deal with. In this paper we exploit the probabilistic interpretation of linear PCA together with recent results on latent variable models in Gaussian Processes in order to introduce an objective function for KPCA. This in turn allows a principled approach to the missing data problem. Furthermore, this new approach can be extended to reconstruct corrupted test data using fixed kernel feature extractors. The experimental results show strong improvements over widely used heuristics.

Neil D. Lawrence | Guido Sanguinetti | Neil D. Lawrence | G. Sanguinetti

[1] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .

[2] Yoshua Bengio,et al. Pattern Recognition and Neural Networks , 1995 .

[3] Christopher K. I. Williams. Computing with Infinite Networks , 1996, NIPS.

[4] Bernhard Schölkopf,et al. Kernel Principal Component Analysis , 1997, ICANN.

[5] Wulfram Gerstner,et al. Artificial Neural Networks — ICANN'97 , 1997, Lecture Notes in Computer Science.

[6] Michael E. Tipping,et al. Mixtures of Principal Component Analysers , 1997 .

[7] Christopher M. Bishop,et al. GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[8] Christopher M. Bishop,et al. Bayesian PCA , 1998, NIPS.

[9] Alexander J. Smola,et al. Learning with kernels , 1998 .

[10] Michael E. Tipping,et al. Probabilistic Principal Component Analysis , 1999 .

[11] Michael E. Tipping. Sparse Kernel Principal Component Analysis , 2000, NIPS.

[12] Neil D. Lawrence,et al. Matching Kernels through K ullback- L eibler Divergence Minimisation , 2004 .

[13] Neil D. Lawrence,et al. Probabilistic Non-linear Principal Component Analysis with Gaussian Process Latent Variable Models , 2005, J. Mach. Learn. Res..