Group Sparse Multiview Patch Alignment Framework With View Consistency for Image Classification

No single feature can satisfactorily characterize the semantic concepts of an image. Multiview learning aims to unify different kinds of features to produce a consensual and efficient representation. This paper redefines part optimization in the patch alignment framework (PAF) and develops a group sparse multiview patch alignment framework (GSM-PAF). The new part optimization considers not only the complementary properties of different views, but also view consistency. In particular, view consistency models the correlations between all possible combinations of any two kinds of view. In contrast to conventional dimensionality reduction algorithms that perform feature extraction and feature selection independently, GSM-PAF enjoys joint feature extraction and feature selection by exploiting l2-norm on the projection matrix to achieve row sparsity, which leads to the simultaneous selection of relevant features and learning transformation, and thus makes the algorithm more discriminative. Experiments on two real-world image data sets demonstrate the effectiveness of GSM-PAF for image classification.

[1]  Cordelia Schmid,et al.  Multimodal semi-supervised learning for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  TaoDacheng,et al.  Large-Margin Multi-ViewInformation Bottleneck , 2014 .

[3]  Dacheng Tao,et al.  Large-Margin Multi-ViewInformation Bottleneck , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Dacheng Tao,et al.  Discriminative Locality Alignment , 2008, ECCV.

[5]  Chris H. Q. Ding,et al.  Non-negative Tri-factor tensor decomposition with applications , 2012, Knowledge and Information Systems.

[6]  H. Zha,et al.  Principal manifolds and nonlinear dimensionality reduction via tangent space alignment , 2004, SIAM J. Sci. Comput..

[7]  You-Shyang Chen,et al.  Application of rough set classifiers for determining hemodialysis adequacy in ESRD patients , 2012, Knowledge and Information Systems.

[8]  Ulf Brefeld,et al.  Co-EM support vector learning , 2004, ICML.

[9]  Anoop Sarkar,et al.  Corrected Co-training for Statistical Parsers , 2003 .

[10]  Hiroyuki Kaji,et al.  Unsupervised Word-Sense Disambiguation Using Bilingual Comparable Corpora , 2002, IEICE Trans. Inf. Syst..

[11]  Steffen Bickel,et al.  Multi-view clustering , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[12]  Wei Li,et al.  Multi-feature Fusion Face Recognition Based on Kernel Discriminate Local Preserve Projection Algorithm under Smart Environment , 2012, J. Comput..

[13]  Jiawei Han,et al.  Joint Feature Selection and Subspace Learning , 2011, IJCAI.

[14]  Yang Yan,et al.  Semi-supervised fuzzy co-clustering algorithm for document categorization , 2011, Knowledge and Information Systems.

[15]  Philip S. Yu,et al.  A General Model for Multiple View Unsupervised Learning , 2008, SDM.

[16]  Sanjoy Dasgupta,et al.  PAC Generalization Bounds for Co-training , 2001, NIPS.

[17]  Cordelia Schmid,et al.  TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[18]  Mark J. Huiskes,et al.  The MIR flickr retrieval evaluation , 2008, MIR '08.

[19]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[20]  Filip Deleus,et al.  A Connectivity-Based Method for Defining Regions-of-Interest in fMRI Data , 2009, IEEE Transactions on Image Processing.

[21]  Christopher J. C. Burges,et al.  Spectral clustering and transductive learning with multiple views , 2007, ICML '07.

[22]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[23]  Sham M. Kakade,et al.  Multi-view clustering via canonical correlation analysis , 2009, ICML '09.

[24]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[25]  Sukhendu Das,et al.  A Survey of Decision Fusion and Feature Fusion Strategies for Pattern Classification , 2010, IETE Technical Review.

[26]  Trevor Darrell,et al.  Unsupervised feature selection via distributed coding for multi-view object recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Siu Cheung Hui,et al.  Supervised term weighting centroid-based classifiers for text categorization , 2012, Knowledge and Information Systems.

[28]  Jian Yang,et al.  Feature fusion: parallel strategy vs. serial strategy , 2003, Pattern Recognit..

[29]  Kaizhu Huang,et al.  m-SNE: Multiview Stochastic Neighbor Embedding , 2011, IEEE Trans. Syst. Man Cybern. Part B.

[30]  M. Kloft,et al.  l p -Norm Multiple Kernel Learning , 2011 .

[31]  Charles A. Micchelli,et al.  Learning Convex Combinations of Continuously Parameterized Basic Kernels , 2005, COLT.

[32]  Yongdong Zhang,et al.  Multiview Spectral Embedding , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[33]  Jieping Ye,et al.  Multi-task Vector Field Learning , 2012, NIPS.

[34]  Daoqiang Zhang,et al.  Multi-view dimensionality reduction via canonical random correlation analysis , 2015, Frontiers of Computer Science.

[35]  Shai Avidan,et al.  Generalized spectral bounds for sparse LDA , 2006, ICML.

[36]  Cheng Soon Ong,et al.  Multiclass multiple kernel learning , 2007, ICML '07.

[37]  Xian-Sheng Hua,et al.  Active Reranking for Web Image Search , 2010, IEEE Transactions on Image Processing.

[38]  Xia De-shen Face Representation and Recognition Using Partial Least Squares Regression , 2008 .

[39]  Nitish Srivastava,et al.  Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[40]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[41]  Massimiliano Pontil,et al.  Convex multi-task feature learning , 2008, Machine Learning.

[42]  Gert R. G. Lanckriet,et al.  Learning Multi-modal Similarity , 2010, J. Mach. Learn. Res..

[43]  Meng Wang,et al.  Optimizing multi-graph learning: towards a unified video annotation scheme , 2007, ACM Multimedia.

[44]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Ramesh C. Jain,et al.  Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images , 2011, TIST.

[46]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[47]  Vladan Velisavljevic,et al.  Multiview Image Coding Using Depth Layers and an Optimized Bit Allocation , 2012, IEEE Transactions on Image Processing.

[48]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[49]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[50]  George Karypis,et al.  A segment-based approach to clustering multi-topic documents , 2012, Knowledge and Information Systems.

[51]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[52]  Xuelong Li,et al.  Patch Alignment for Dimensionality Reduction , 2009, IEEE Transactions on Knowledge and Data Engineering.

[53]  Wei Jia,et al.  Discriminant sparse neighborhood preserving embedding for face recognition , 2012, Pattern Recognit..

[54]  Claire Cardie,et al.  Limitations of Co-Training for Natural Language Learning from Large Datasets , 2001, EMNLP.

[55]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[56]  Bao-Liang Lu,et al.  Fast recognition of multi-view faces with feature selection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[57]  Bernhard Schölkopf,et al.  Generalization Bounds for Convex Combinations of Kernel Functions , 1998 .

[58]  Jiawei Han,et al.  Spectral Regression: A Unified Approach for Sparse Subspace Learning , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[59]  Nicolas Le Roux,et al.  Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering , 2003, NIPS.

[60]  Jian Pei,et al.  Parallel field alignment for cross media retrieval , 2013, ACM Multimedia.

[61]  Verónica Bolón-Canedo,et al.  A review of feature selection methods on synthetic data , 2013, Knowledge and Information Systems.

[62]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[63]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[64]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[65]  Thomas Brox,et al.  Multiview Deblurring for 3-D Images from Light-Sheet-Based Fluorescence Microscopy , 2012, IEEE Transactions on Image Processing.

[66]  Xiaofei He,et al.  Parallel vector field embedding , 2013, J. Mach. Learn. Res..

[67]  Chris H. Q. Ding,et al.  R1-PCA: rotational invariant L1-norm principal component analysis for robust subspace factorization , 2006, ICML.

[68]  Dean P. Foster Multi-View Dimensionality Reduction via Canonical Correlation Multi-View Dimensionality Reduction via Canonical Correlation Analysis Analysis Multi-View Dimensionality Reduction via Canonical Correlation Analysis Multi-View Dimensionality Reduction via Canonical Correlation Analysis Multi-View Dimen , 2008 .

[69]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[70]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[71]  Xindong Wu,et al.  How to Estimate the Regularization Parameter for Spectral Regression Discriminant Analysis and its Kernel Version? , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[72]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[73]  Pascal Frossard,et al.  Optimal Image Alignment With Random Projections of Manifolds: Algorithm and Geometric Analysis , 2011, IEEE Transactions on Image Processing.

[74]  Antonio Ortega,et al.  On Dependent Bit Allocation for Multiview Image Coding With Depth-Image-Based Rendering , 2011, IEEE Transactions on Image Processing.

[75]  Bart Thomee,et al.  New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative , 2010, MIR '10.

[76]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[77]  Jie Gui,et al.  Multi-step dimensionality reduction and semi-supervised graph-based tumor classification using gene expression data , 2010, Artif. Intell. Medicine.

[78]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[79]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[80]  Tat-Seng Chua,et al.  Semantic-Gap-Oriented Active Learning for Multilabel Image Annotation , 2012, IEEE Transactions on Image Processing.

[81]  Huan Liu,et al.  Multi-Source Feature Selection via Geometry-Dependent Covariance Analysis , 2008, FSDM.

[82]  Wei Wang,et al.  Learning Coupled Feature Spaces for Cross-Modal Matching , 2013, 2013 IEEE International Conference on Computer Vision.

[83]  Hongyuan Zha,et al.  Principal Manifolds and Nonlinear Dimension Reduction via Local Tangent Space Alignment , 2002, ArXiv.

[84]  Tat-Seng Chua,et al.  Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations , 2010, IEEE Transactions on Multimedia.

[85]  Wei Jia,et al.  Locality preserving discriminant projections for face and palmprint recognition , 2010, Neurocomputing.

[86]  Cordelia Schmid,et al.  Coloring Local Feature Extraction , 2006, ECCV.

[87]  Dacheng Tao,et al.  A Survey on Multi-view Learning , 2013, ArXiv.