论文信息 - Clustered blockwise PCA for representing visual data

Clustered blockwise PCA for representing visual data

Principal component analysis (PCA) is extensively used in computer vision and image processing. Since it provides the optimal linear subspace in a least-square sense, it has been used for dimensionality reduction and subspace analysis in various domains. However, its scalability is very limited because of its inherent computational complexity. We introduce a new framework for applying PCA to visual data which takes advantage of the spatio-temporal correlation and localized frequency variations that are typically found in such data. Instead of applying PCA to the whole volume of data (complete set of images), we partition the volume into a set of blocks and apply PCA to each block. Then, we group the subspaces corresponding to the blocks and merge them together. As a result, we not only achieve greater efficiency in the resulting representation of the visual data, but also successfully scale PCA to handle large data sets. We present a thorough analysis of the computational complexity and storage benefits of our approach. We apply our algorithm to several types of videos. We show that, in addition to its storage and speed benefits, the algorithm results in a useful representation of the visual data.

[1] Jang-Gyu Lee,et al. On updating the singular value decomposition , 1996, Proceedings of International Conference on Communication Technology. ICCT '96.

[2] Michael Lindenbaum,et al. Partial eigenvalue decomposition of large images using spatial temporal adaptive method , 1995, IEEE Trans. Image Process..

[3] Karl Pearson F.R.S.. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[4] Amnon Shashua,et al. On Photometric Issues in 3D Visual Recognition from a Single 2D Image , 2004, International Journal of Computer Vision.

[5] Gene H. Golub,et al. Matrix computations , 1983 .

[6] Peter W. Hallinan. A low-dimensional representation of human faces for arbitrary lighting conditions , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[7] Paul E. Debevec,et al. Acquiring the reflectance field of a human face , 2000, SIGGRAPH.

[8] B. V. K. Vijaya Kumar,et al. Efficient Calculation of Primary Images from a Set of Images , 1982, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] V. N. Bogaevski,et al. Matrix Perturbation Theory , 1991 .

[10] Shai Avidan. EigenSegments: A Spatio-Temporal Decomposition of an Ensemble of Images , 2002, ECCV.

[11] Ronald D. Degroat,et al. Efficient, Numerically Stabilized Rank-One Eigenstructure Up.dating , 1990 .

[12] Horst Bischof,et al. Multiple eigenspaces , 2002, Pattern Recognit..

[13] Alex Pentland,et al. Mixtures of eigenfeatures for real-time structure from texture , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[14] Eero P. Simoncelli,et al. Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[15] Pavel Pudil,et al. Introduction to Statistical Pattern Recognition , 2006 .

[16] Paul A. Beardsley,et al. Image-based 3D photography using opacity hulls , 2002, ACM Trans. Graph..

[17] Keinosuke Fukunaga,et al. Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[18] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .

[19] Harry Shum,et al. Relighting with the Reflected Irradiance Field: Representation, Sampling and Reconstruction , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20] Li Zhao,et al. Mosaic image method: a local and global method , 1999, Pattern Recognit..

[21] Amnon Shashua,et al. Principal Component Analysis over Continuous Subspaces and Intersection of Half-Spaces , 2002, ECCV.

[22] S. Eisenstat,et al. A Stable and Efficient Algorithm for the Rank-One Modification of the Symmetric Eigenproblem , 1994, SIAM J. Matrix Anal. Appl..

[23] H. Hotelling. Analysis of a complex of statistical variables into principal components. , 1933 .

[24] L. Sirovich. Turbulence and the dynamics of coherent structures. I. Coherent structures , 1987 .

[25] Ralph R. Martin,et al. Merging and Splitting Eigenspace Models , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[26] Christopher M. Bishop,et al. Bayesian PCA , 1998, NIPS.

[27] B. S. Manjunath,et al. An Eigenspace Update Algorithm for Image Analysis , 1997, CVGIP Graph. Model. Image Process..

[28] Alex Pentland,et al. Probabilistic visual learning for object detection , 1995, Proceedings of IEEE International Conference on Computer Vision.