论文信息 - Bayesian Robust PCA of Incomplete Data

Bayesian Robust PCA of Incomplete Data

We present a probabilistic model for robust principal component analysis (PCA) in which the observation noise is modelled by Student-t distributions that are independent for different data dimensions. A heavy-tailed noise distribution is used to reduce the negative effect of outliers. Intractability of posterior evaluation is solved using variational Bayesian approximation methods. We show experimentally that the proposed model can be a useful tool for PCA preprocessing for incomplete noisy data. We also demonstrate that the assumed noise model can yield more accurate reconstructions of missing values: Corrupted dimensions of a "bad" sample may be reconstructed well from other dimensions of the same data vector. The model was motivated by a real-world weather dataset which was used for comparison of the proposed technique to relevant probabilistic PCA models.

[1] Tapani Raiko,et al. Tkk Reports in Information and Computer Science Practical Approaches to Principal Component Analysis in the Presence of Missing Values Tkk Reports in Information and Computer Science Practical Approaches to Principal Component Analysis in the Presence of Missing Values , 2022 .

[2] Michel Verleysen,et al. Robust probabilistic projections , 2006, ICML.

[3] I. Jolliffe. Principal Component Analysis , 2002 .

[4] Charles M. Bishop. Variational principal components , 1999 .

[5] J ValdésJulio,et al. 2006 Special issue , 2006 .

[6] Junbin Gao,et al. Robust L1 Principal Component Analysis and Its Bayesian Variational Inference , 2008, Neural Computation.

[7] Erkki Oja,et al. Exploratory analysis of climate data using source separation methods , 2006, Neural Networks.

[8] Michael E. Tipping,et al. Probabilistic Principal Component Analysis , 1999 .

[9] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[10] J. Zhao,et al. Probabilistic PCA for t distributions , 2006, Neurocomputing.

[11] E. Oja,et al. Independent Component Analysis , 2013 .

[12] Andrzej Cichocki,et al. Adaptive Blind Signal and Image Processing - Learning Algorithms and Applications , 2002 .