Discriminative and Uncorrelated Feature Selection With Constrained Spectral Analysis in Unsupervised Learning

The existing unsupervised feature extraction methods frequently explore low-redundant features by an uncorrelated constraint. However, the constrained models might incur trivial solutions, due to the singularity of scatter matrix triggered by high-dimensional data. In this paper, we propose a regularized regression model with a generalized uncorrelated constraint for feature selection, which leads to three merits: 1) exploring the low-redundant and discriminative features; 2) avoiding the trivial solutions and 3) simplifying the optimization. Besides that, the local cluster structure is achieved via a novel constrained spectral analysis for the unsupervised learning, where Must-Links and Cannot-Links are transformed into a intrinsic graph and a penalty graph respectively, rather than incorporated into a mixed affinity graph. Accordingly, a discriminative and uncorrelated feature selection with constrained spectral analysis (DUCFS) is proposed with adopting $\sigma $ -norm regularization for interpolating between F-norm and $\ell _{2,1}$ -norm. Due to the flexible gradient and global differentiability, our model converges fast. Extensive experiments on benchmark datasets among several state-of-the-art approaches verify the effectiveness of the proposed method.

[1]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[2]  Feiping Nie,et al.  Robust Manifold Nonnegative Matrix Factorization , 2014, ACM Trans. Knowl. Discov. Data.

[3]  Xuelong Li,et al.  Unsupervised Feature Selection via Adaptive Multimeasure Fusion , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[5]  Ming Shao,et al.  Consensus Guided Unsupervised Feature Selection , 2016, AAAI.

[6]  Xuelong Li,et al.  A generalized power iteration method for solving quadratic problem on the Stiefel manifold , 2017, Science China Information Sciences.

[7]  Sohail Asghar,et al.  A REVIEW OF FEATURE SELECTION TECHNIQUES IN STRUCTURE LEARNING , 2013 .

[8]  Ian Davidson,et al.  On constrained spectral clustering and its applications , 2012, Data Mining and Knowledge Discovery.

[9]  Xuelong Li,et al.  Joint Embedding Learning and Sparse Regression: A Framework for Unsupervised Feature Selection , 2014, IEEE Transactions on Cybernetics.

[10]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[11]  Lei Shi,et al.  Robust Spectral Learning for Unsupervised Feature Selection , 2014, 2014 IEEE International Conference on Data Mining.

[12]  Junmo Kim,et al.  Unsupervised Simultaneous Orthogonal basis Clustering Feature Selection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Xuelong Li,et al.  Unsupervised Feature Selection with Structured Graph Optimization , 2016, AAAI.

[14]  Ian Davidson,et al.  Flexible constrained spectral clustering , 2010, KDD.

[15]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[16]  Jin Zhong AN OPTIMAL SET OF UNCORRELATED DISCRIMINANT FEATURES , 1999 .

[17]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Xuelong Li,et al.  Generalized Uncorrelated Regression with Adaptive Graph for Unsupervised Feature Selection , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Xuelong Li,et al.  A Generalized Uncorrelated Ridge Regression with Nonnegative Labels for Unsupervised Feature Selection , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[21]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Feiping Nie,et al.  Self-Weighted Supervised Discriminative Feature Selection , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[23]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[24]  Mikkel B. Stegmann,et al.  The IMM Face Database, An Annotated Dataset of 240 Face Images , 2004 .

[25]  Ronald A. Cole,et al.  Spoken Letter Recognition , 1990, HLT.

[26]  Sameer A. Nene,et al.  Columbia Object Image Library (COIL100) , 1996 .

[27]  Xuelong Li,et al.  Self-Tuned Discrimination-Aware Method for Unsupervised Feature Selection , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Ivor W. Tsang,et al.  Flexible Manifold Embedding: A Framework for Semi-Supervised and Unsupervised Dimension Reduction , 2010, IEEE Transactions on Image Processing.

[29]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[30]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[31]  Jing Liu,et al.  Unsupervised Feature Selection Using Nonnegative Spectral Analysis , 2012, AAAI.

[32]  Kenneth Steiglitz,et al.  Combinatorial Optimization: Algorithms and Complexity , 1981 .

[33]  Yurii Nesterov,et al.  Generalized Power Method for Sparse Principal Component Analysis , 2008, J. Mach. Learn. Res..

[34]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[35]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[36]  Alain Rakotomamonjy,et al.  Variable Selection Using SVM-based Criteria , 2003, J. Mach. Learn. Res..

[37]  Feiping Nie,et al.  Trace Ratio Criterion for Feature Selection , 2008, AAAI.

[38]  Huan Liu,et al.  Embedded Unsupervised Feature Selection , 2015, AAAI.

[39]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[40]  Deng Cai,et al.  Unsupervised feature selection for multi-cluster data , 2010, KDD.

[41]  B. Green THE ORTHOGONAL APPROXIMATION OF AN OBLIQUE STRUCTURE IN FACTOR ANALYSIS , 1952 .