Structured learning for unsupervised feature selection with high-order matrix factorization

Abstract Feature selection aims at searching the most discriminative and relevant features from high-dimensional data to improve the performance of certain learning tasks. Whereas, irrelevant or redundant features may increase the over-fitting risk of consequent learning algorithms. Structured learning of feature selection is to embed intrinsic structures of data, such as geometric structures and manifold structures, resulting in the improvement of learning performance. In this paper, three types of structured regularizers are embedded into the feature selection framework and an iterative algorithm with proved convergence for feature selection problem is proposed. First, serving as crucial representation pipelines of local structures, three types of local learning regularizers, including graph Laplacian, neighborhood preservation and sparsity regularizer, are defined. Second, the local and global structures are integrated into one joint framework for the feature selection problem. Third, the framework is formulated as the canonical form of high-order matrix factorizations and then an efficient convergent iterative algorithm is proposed for the problem. Besides, the proposed framework is further extended to multi-view feature selection and fusion problems from an algorithmic view. Finally, the proposed algorithm is tested on eight publicly available datasets and compared to several state-of-the-art feature selection methods. Experimental results demonstrate the superiority of the proposed method against the compared algorithms in terms of clustering performance.

[1]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[2]  Jing Liu,et al.  Unsupervised Feature Selection Using Nonnegative Spectral Analysis , 2012, AAAI.

[3]  Lei Wang,et al.  Global and Local Structure Preservation for Feature Selection , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Feiping Nie,et al.  Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Feature Selection via Joint Embedding Learning and Sparse Regression , 2022 .

[5]  Mikhail F. Kanevski,et al.  Feature Selection for Regression Problems Based on the Morisita Estimator of Intrinsic Dimension: Concept and Case Studies , 2016, Pattern Recognit..

[6]  Zi Huang,et al.  Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence ℓ2,1-Norm Regularized Discriminative Feature Selection for Unsupervised Learning , 2022 .

[7]  R. Gray Entropy and Information Theory , 1990, Springer New York.

[8]  Huan Liu,et al.  Feature selection for clustering - a filter solution , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[9]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[10]  Ya-Feng Liu,et al.  LLE Score: A New Filter-Based Unsupervised Feature Selection Method Based on Nonlinear Manifold Embedding and Its Application to Image Recognition , 2017, IEEE Transactions on Image Processing.

[11]  Hossam Faris,et al.  Binary grasshopper optimisation algorithm approaches for feature selection problems , 2019, Expert Syst. Appl..

[12]  Huan Liu,et al.  Challenges of Feature Selection for Big Data Analytics , 2016, IEEE Intelligent Systems.

[13]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[14]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[15]  Tieniu Tan,et al.  Feature Selection Based on Structured Sparsity: A Comprehensive Study , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Qinghua Hu,et al.  Subspace clustering guided unsupervised feature selection , 2017, Pattern Recognit..

[17]  Jingjing Li,et al.  Adaptive Collaborative Similarity Learning for Unsupervised Multi-view Feature Selection , 2018, IJCAI.

[18]  Xiaofeng Zhu,et al.  Local and Global Structure Preservation for Robust Unsupervised Spectral Feature Selection , 2018, IEEE Transactions on Knowledge and Data Engineering.

[19]  Jean Paul Barddal,et al.  Merit-guided dynamic feature selection filter for data streams , 2019, Expert Syst. Appl..

[20]  Witold Pedrycz,et al.  Subspace learning for unsupervised feature selection via matrix factorization , 2015, Pattern Recognit..

[21]  Shuicheng Yan,et al.  Neighborhood preserving embedding , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[22]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[23]  Mohamed F. Ghalwash,et al.  Structured feature selection using coordinate descent optimization , 2016, BMC Bioinformatics.

[24]  Feiping Nie,et al.  Nonnegative Matrix Tri-factorization Based High-Order Co-clustering and Its Fast Implementation , 2011, 2011 IEEE 11th International Conference on Data Mining.

[25]  Qinghua Zheng,et al.  Adaptive Unsupervised Feature Selection With Structure Regularization , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[26]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[27]  Shichao Zhang,et al.  Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Fang Liu,et al.  Unsupervised feature selection based on maximum information and minimum redundancy for hyperspectral images , 2016, Pattern Recognit..

[29]  Yong Luo,et al.  Cost-Sensitive Feature Selection by Optimizing F-Measures , 2018, IEEE Transactions on Image Processing.

[30]  Xuelong Li,et al.  Generalized Uncorrelated Regression with Adaptive Graph for Unsupervised Feature Selection , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[31]  Badong Chen,et al.  Maximum Correntropy Criterion-Based Sparse Subspace Learning for Unsupervised Feature Selection , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Jie Tian,et al.  Robust graph regularized unsupervised feature selection , 2018, Expert Syst. Appl..

[33]  William Zhu,et al.  Sparse Graph Embedding Unsupervised Feature Selection , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[34]  Chris H. Q. Ding,et al.  Convex and Semi-Nonnegative Matrix Factorizations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  ChengXiang Zhai,et al.  Robust Unsupervised Feature Selection , 2013, IJCAI.

[36]  Wenzhong Guo,et al.  Sparse Multigraph Embedding for Multimodal Feature Representation , 2017, IEEE Transactions on Multimedia.

[37]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[38]  Lei Wang,et al.  On Similarity Preserving Feature Selection , 2013, IEEE Transactions on Knowledge and Data Engineering.

[39]  Witold Pedrycz,et al.  Global and local structure preserving sparse subspace learning: An iterative approach to unsupervised feature selection , 2015, Pattern Recognit..