论文信息 - Subspaces Indexing Model on Grassmann Manifold for Image Search

Subspaces Indexing Model on Grassmann Manifold for Image Search

Conventional linear subspace learning methods like principal component analysis (PCA), linear discriminant analysis (LDA) derive subspaces from the whole data set. These approaches have limitations in the sense that they are linear while the data distribution we are trying to model is typically nonlinear. Moreover, these algorithms fail to incorporate local variations of the intrinsic sample distribution manifold. Therefore, these algorithms are ineffective when applied on large scale datasets. Kernel versions of these approaches can alleviate the problem to certain degree but face a serious computational challenge when data set is large, where the computing involves Eigen/QP problems of size N × N. When N is large, kernel versions are not computationally practical. To tackle the aforementioned problems and improve recognition/searching performance, especially on large scale image datasets, we propose a novel local subspace indexing model for image search termed Subspace Indexing Model on Grassmann Manifold (SIM-GM). SIM-GM partitions the global space into local patches with a hierarchical structure; the global model is, therefore, approximated by piece-wise linear local subspace models. By further applying the Grassmann manifold distance, SIM-GM is able to organize localized models into a hierarchy of indexed structure, and allow fast query selection of the optimal ones for classification. Our proposed SIM-GM enjoys a number of merits: 1) it is able to deal with a large number of training samples efficiently; 2) it is a query-driven approach, i.e., it is able to return an effective local space model, so the recognition performance could be significantly improved; 3) it is a common framework, which can incorporate many learning algorithms. Theoretical analysis and extensive experimental results confirm the validity of this model.

[1] Xindong Wu,et al. Manifold elastic net: a unified framework for sparse dimension reduction , 2010, Data Mining and Knowledge Discovery.

[2] Gilbert Strang,et al. Computational Science and Engineering , 2007 .

[3] Ian T. Jolliffe,et al. Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[4] Michael I. Jordan,et al. Learning with Mixtures of Trees , 2001, J. Mach. Learn. Res..

[5] Xuelong Li,et al. General Tensor Discriminant Analysis and Gabor Features for Gait Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Pengfei Shi,et al. Kernel Grassmannian distances and discriminant analysis for face recognition from image sets , 2009, Pattern Recognit. Lett..

[7] Aggelos K. Katsaggelos,et al. Locally Embedded Linear Subspaces for Efficient Video Indexing and Retrieval , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[8] Daniel D. Lee,et al. Grassmann discriminant analysis: a unifying view on subspace-based learning , 2008, ICML '08.

[9] Bernhard Schölkopf,et al. A kernel view of the dimensionality reduction of manifolds , 2004, ICML.

[10] Aggelos K. Katsaggelos,et al. Fast video shot retrieval based on trace geometry matching , 2005 .

[11] I. Jolliffe. Principal Component Analysis , 2002 .

[12] Meng Wang,et al. MSRA-MM 2.0: A Large-Scale Web Multimedia Dataset , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[13] Xuelong Li,et al. Geometric Mean for Subspace Selection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Bülent Sankur,et al. Subspace methods for retrieval of general 3D models , 2010, Comput. Vis. Image Underst..

[15] Zhigang Luo,et al. Manifold Regularized Discriminative Nonnegative Matrix Factorization With Fast Gradient Descent , 2011, IEEE Transactions on Image Processing.

[16] Dacheng Tao,et al. Max-Min Distance Analysis by Using Sequential SDP Relaxation for Dimension Reduction , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Jiawei Han,et al. Orthogonal Laplacianfaces for Face Recognition , 2006, IEEE Transactions on Image Processing.

[18] Bülent Sankur,et al. Similarity Learning for 3D Object Retrieval Using Relevance Feedback and Risk Minimization , 2010, International Journal of Computer Vision.

[19] Daniel D. Lee,et al. Subspace-based learning with grassmann kernels , 2008 .

[20] V. Strassen. Gaussian elimination is not optimal , 1969 .

[21] Xuelong Li,et al. Patch Alignment for Dimensionality Reduction , 2009, IEEE Transactions on Knowledge and Data Engineering.

[22] Ying Wu,et al. Query Driven Localized Linear Discriminant Models for Head Pose Estimation , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[23] Libor Spacek,et al. Distinctive Descriptions for Face Processing , 1997, BMVC.

[24] David J. Kriegman,et al. Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[25] Mikhail Belkin,et al. Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[26] Lawrence K. Saul,et al. Think Globally, Fit Locally: Unsupervised Learning of Low Dimensional Manifold , 2003, J. Mach. Learn. Res..

[27] Dacheng Tao,et al. Bregman Divergence-Based Regularization for Transfer Subspace Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[28] Yun Fu,et al. Image Classification Using Correlation Tensor Analysis , 2008, IEEE Transactions on Image Processing.

[29] Xiaofei He,et al. Locality Preserving Projections , 2003, NIPS.