Supervised dimensionality reduction that preserves both global and local information

Dimensionality reduction is a problem of fundamental importance in both machine learning and data mining. In this paper, we develop a new approach that can process labeled datasets and accurately reduce their dimensionalities. The approach is based on a new objective that contains information from both the global and local structures of a data set. An iterative approach is used to compute the directions along which the objective can be optimized. Experiments on benchmark data sets show that both the global and local information in a dataset can be effectively captured by this approach and it is thus able to provide more accurate results for dimensionality reduction than some existing approaches.

[1]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[2]  Jieping Ye,et al.  Integrating Global and Local Structures: A Least Squares Framework for Dimensionality Reduction , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Yinglei Song On the independent set problem in random graphs , 2015, Int. J. Comput. Math..

[4]  Yinglei Song,et al.  An improved parameterized algorithm for the independent feedback vertex set problem , 2013, Theor. Comput. Sci..

[5]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[6]  Yinglei Song,et al.  A New Parameterized Algorithm for Rapid Peptide Sequencing , 2014, PloS one.

[7]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[9]  Joshua B. Tenenbaum,et al.  Global Versus Local Methods in Nonlinear Dimensionality Reduction , 2002, NIPS.

[10]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[11]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[12]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[13]  David G. Stork,et al.  Pattern Classification , 1973 .

[14]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[15]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.