Discriminative Unsupervised Dimensionality Reduction

As an important machine learning topic, dimensionality reduction has been widely studied and utilized in various kinds of areas. A multitude of dimensionality reduction methods have been developed, among which unsupervised dimensionality reduction is more desirable when obtaining label information requires onerous work. However, most previous unsupervised dimensionality reduction methods call for an affinity graph constructed beforehand, with which the following dimensionality reduction steps can be then performed. Separation of graph construction and dimensionality reduction leads the dimensionality reduction process highly dependent on quality of the input graph. In this paper, we propose a novel graph embedding method for unsupervised dimensionality reduction. We simultaneously conduct dimensionality reduction along with graph construction by assigning adaptive and optimal neighbors according to the projected local distances. Our method doesn't need an affinity graph constructed in advance, but instead learns the graph concurrently with dimensionality reduction. Thus, the learned graph is optimal for dimensionality reduction. Meanwhile, our learned graph has an explicit block diagonal structure, from which the clustering results could be directly revealed without any postprocessing steps. Extensive empirical results on dimensionality reduction as well as clustering are presented to corroborate the performance of our method.

[1]  A. Macallum The University of Toronto , 1907, Nature.

[2]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[3]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[4]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[5]  K. Fan On a Theorem of Weyl Concerning Eigenvalues of Linear Transformations I. , 1949, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Feiping Nie,et al.  Neighborhood MinMax Projections , 2007, IJCAI.

[7]  Feiping Nie,et al.  Semi-supervised orthogonal discriminant analysis via label propagation , 2009, Pattern Recognit..

[8]  K. Fan On a Theorem of Weyl Concerning Eigenvalues of Linear Transformations: II. , 1949, Proceedings of the National Academy of Sciences of the United States of America.

[9]  B. Mohar THE LAPLACIAN SPECTRUM OF GRAPHS y , 1991 .

[10]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[11]  Cheng Li,et al.  Fisher Linear Discriminant Analysis , 2014 .

[12]  Edward Y. Chang,et al.  Parallel Spectral Clustering in Distributed Systems , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[14]  Michael William Newman,et al.  The Laplacian spectrum of graphs , 2001 .

[15]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[16]  A. Martínez,et al.  The AR face databasae , 1998 .

[17]  Ja-Chen Lin,et al.  A new LDA-based face recognition system which can solve the small sample size problem , 1998, Pattern Recognit..

[18]  Rob H. Bisseling,et al.  Parallel Scientific Computation , 2004 .

[19]  Hua Yu,et al.  A direct LDA algorithm for high-dimensional data - with application to face recognition , 2001, Pattern Recognit..

[20]  Feiping Nie,et al.  Unsupervised Feature Selection via Unified Trace Ratio Formulation and K-means Clustering (TRACK) , 2014, ECML/PKDD.

[21]  Michael J. Lyons,et al.  Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[22]  Feiping Nie,et al.  Flexible Shift-Invariant Locality and Globality Preserving Projections , 2014, ECML/PKDD.

[23]  Jiri Matas,et al.  XM2VTSDB: The Extended M2VTS Database , 1999 .

[24]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[25]  Joaquin Vanschoren,et al.  EUROPEAN CONFERENCE ON MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES , 2012 .

[26]  Feiping Nie,et al.  A general kernelization framework for learning algorithms based on kernel PCA , 2010, Neurocomputing.

[27]  Sameer A. Nene,et al.  Columbia Object Image Library (COIL100) , 1996 .

[28]  James C. Bezdek,et al.  Convergence of Alternating Optimization , 2003, Neural Parallel Sci. Comput..

[29]  H. Damasio,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence: Special Issue on Perceptual Organization in Computer Vision , 1998 .

[30]  Yousef Saad,et al.  Orthogonal Neighborhood Preserving Projections: A Projection-Based Dimensionality Reduction Technique , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.