Structural max-margin discriminant analysis for feature extraction

Subclass discriminant analysis (SDA) is a recently developed dimensionality reduction technique which takes into consideration the intrinsic structure information lurking in data by approximating unknown distribution of each class with multiple Gaussian distributions, namely, subclasses. However, in SDA, the separability between heterogeneous subclasses, i.e. those from different classes, is measured by the between-subclass scatter calculated as average distance between the means of these subclasses. In this paper, in the view of maximum margin principle, we propose a novel feature extraction method coined structural max-margin discriminant analysis (SMDA), in order to enhance the performance of SDA. Specifically, SMDA targets at finding an orthogonal linear embedded subspace in which the margin, defined as the minimum pairwise between-subclasses distance, is maximized and simultaneously the within-subclasses scatter is minimized. The concrete formulation of the resulting model boils down to a nonconvex optimization problem that can be solved by combining the constrained concave-convex procedure with the column generation technique. We evaluate the proposed SMDA on several benchmark datasets and the experimental results confirm the effectiveness of the proposed method.

[1]  Jiawei Han,et al.  Isometric Projection , 2007, AAAI.

[2]  Stephen Lin,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[4]  Philip Chan,et al.  Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[5]  Anastasios Tefas,et al.  Exploiting subclass information in Support Vector Machines , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[6]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[7]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[8]  Stephen Lin,et al.  Marginal Fisher Analysis and Its Variants for Human Gait Recognition and Content- Based Image Retrieval , 2007, IEEE Transactions on Image Processing.

[9]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[10]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[11]  Yuan-Hai Shao,et al.  A regularization for the projection twin support vector machine , 2013, Knowl. Based Syst..

[12]  Hwann-Tzong Chen,et al.  Local discriminant embedding and its variants , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Alan L. Yuille,et al.  The Concave-Convex Procedure , 2003, Neural Computation.

[14]  Fei Wang,et al.  Unsupervised Large Margin Discriminative Projection , 2011, IEEE Transactions on Neural Networks.

[15]  Sinisa Todorovic,et al.  Local-Learning-Based Feature Selection for High-Dimensional Data Analysis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Jian Yang,et al.  Regularized least squares fisher linear discriminant with applications to image recognition , 2013, Neurocomputing.

[17]  Jian Yang,et al.  Recursive projection twin support vector machine via within-class variance minimization , 2011, Pattern Recognit..

[18]  Dong Xu,et al.  Structural twin parametric-margin support vector machine for binary classification , 2013, Knowl. Based Syst..

[19]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[20]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[21]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[22]  Jim Jing-Yan Wang,et al.  Discriminative sparse coding on multi-manifolds , 2013, Knowl. Based Syst..

[23]  Anastasios Tefas,et al.  Subclass discriminant Nonnegative Matrix Factorization for facial image analysis , 2012, Pattern Recognit..

[24]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[25]  Fei Wang,et al.  Efficient Maximum Margin Clustering via Cutting Plane Algorithm , 2008, SDM.

[26]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[27]  Aleix M. Martínez,et al.  Subclass discriminant analysis , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  P. Groenen,et al.  Modern multidimensional scaling , 1996 .

[29]  Bo Xu,et al.  Dimensionality Reduction by Minimal Distance Maximization , 2010, 2010 20th International Conference on Pattern Recognition.

[30]  Yunde Jia,et al.  Discriminant Clustering Embedding for Face Recognition with Image Sets , 2007, ACCV.

[31]  Kun Zhou,et al.  Locality Sensitive Discriminant Analysis , 2007, IJCAI.

[32]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[33]  Shuicheng Yan,et al.  Neighborhood preserving embedding , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[34]  Jian Yang,et al.  Recursive robust least squares support vector regression based on maximum correntropy criterion , 2012, Neurocomputing.

[35]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[36]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[37]  Masashi Sugiyama,et al.  Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis , 2007, J. Mach. Learn. Res..

[38]  Chunxia Zhao,et al.  Recursive "concave-convex" Fisher Linear Discriminant with applications to face, handwritten digit and terrain recognition , 2012, Pattern Recognit..

[39]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[40]  Gunnar Rätsch,et al.  Large Scale Multiple Kernel Learning , 2006, J. Mach. Learn. Res..

[41]  Qiang Yang,et al.  Structural Regularized Support Vector Machine: A Framework for Structural Large Margin Classifier , 2011, IEEE Transactions on Neural Networks.

[42]  Zhong Jin,et al.  Face recognition using discriminant locality preserving projections based on maximum margin criterion , 2010, Pattern Recognit..

[43]  Yong Shi,et al.  Structural twin support vector machine for classification , 2013, Knowl. Based Syst..

[44]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.