A review of the current status and future directions of research on subspace clustering feature selection

Feature selection reduces the dimensionality of high-dimensional data by removing redundant or irrelevant features from the original features, thus reducing the negative impact of the “dimensionality curse.” Subspace clustering feature selection methods focus on the structure and properties within the dataset, so they perform well in unsupervised feature selection work. In this paper, we sort out and classify the research on subspace clustering feature selection and propose several future research trends based on the current status of feature selection in subspace clustering.

[1]  K. Balakrishnan,et al.  Feature selection techniques for microarray datasets: a comprehensive review, taxonomy, and future directions , 2022, Frontiers of Information Technology & Electronic Engineering.

[2]  T. Krilavičius,et al.  A Review of the Modification Strategies of the Nature Inspired Algorithms for Feature Selection Problem , 2022, Mathematics.

[3]  X. Wu,et al.  Online feature selection for multi-source streaming features , 2022, Inf. Sci..

[4]  Maritza Mera-Gaona,et al.  Framework for the Ensemble of Feature Selection Methods , 2021, Applied Sciences.

[5]  Hao Huang,et al.  Learning Associations between Features and Clusters: An Interpretable Deep Clustering Method , 2021, 2021 International Joint Conference on Neural Networks (IJCNN).

[6]  Tarik Abu-Ain,et al.  Feature-Selection and Mutual-Clustering Approaches to Improve DoS Detection and Maintain WSNs’ Lifetime , 2021, Sensors.

[7]  Mohammadreza Sadeghi,et al.  Deep Clustering with Self-supervision using Pairwise Data Similarities , 2021 .

[8]  Dae-Won Kim,et al.  Pairwise dependence-based unsupervised feature selection , 2021, Pattern Recognit..

[9]  Hossein Nezamabadi-pour,et al.  MFS-MCDM: Multi-label feature selection using multi-criteria decision making , 2020, Knowl. Based Syst..

[10]  Suqin Ji,et al.  Clustering ensemble of massive high dimensional data based on BLB and stratified sampling framework , 2020, Proceedings of the 2020 Conference on Artificial Intelligence and Healthcare.

[11]  Kamlesh Dutta,et al.  A comprehensive review on feature set used for anaphora resolution , 2020, Artificial Intelligence Review.

[12]  Muhammad Jamil Khan,et al.  An Innovative Multi-Model Neural Network Approach for Feature Selection in Emotion Recognition Using Deep Feature Clustering , 2020, Sensors.

[13]  Mengjie Zhang,et al.  A survey on swarm intelligence approaches to feature selection in data mining , 2020, Swarm Evol. Comput..

[14]  Bing Xue,et al.  A survey on feature selection approaches for clustering , 2020, Artificial Intelligence Review.

[15]  Hadi Zare,et al.  Unsupervised Feature Selection based on Adaptive Similarity Learning and Subspace Clustering , 2019, Eng. Appl. Artif. Intell..

[16]  Verónica Bolón-Canedo,et al.  Ensembles for feature selection: A review and future trends , 2019, Inf. Fusion.

[17]  2019 IEEE/ACS 16th International Conference on Computer Systems and Applications (AICCSA) , 2019 .

[18]  Nadjia Benblidia,et al.  Fundamentals of Feature Selection: An Overview and Comparison , 2019, 2019 IEEE/ACS 16th International Conference on Computer Systems and Applications (AICCSA).

[19]  Xuelong Li,et al.  Feature selection with multi-view data: A survey , 2019, Inf. Fusion.

[20]  Raúl Santos-Rodríguez,et al.  N2D: (Not Too) Deep Clustering via Clustering the Local Manifold of an Autoencoded Embedding , 2019, 2020 25th International Conference on Pattern Recognition (ICPR).

[21]  Shengxiang Yang,et al.  Dynamic Feature Selection for Clustering High Dimensional Data Streams , 2019, IEEE Access.

[22]  Zhouchen Lin,et al.  Self-Supervised Convolutional Subspace Clustering Network , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Mauro Fasano,et al.  Statistical analysis of proteomics data: A review on feature selection. , 2019, Journal of proteomics.

[24]  Azuraliza Abu Bakar,et al.  A review of feature selection techniques in sentiment analysis , 2019, Intell. Data Anal..

[25]  Steven L. Waslander,et al.  Network Uncertainty Informed Semantic Feature Selection for Visual SLAM , 2018, 2019 16th Conference on Computer and Robot Vision (CRV).

[26]  Xuelong Li,et al.  Structure preserving unsupervised feature selection , 2018, Neurocomputing.

[27]  Adriano Veloso,et al.  Learning to Rank with Deep Autoencoder Features , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[28]  Pichao Wang,et al.  Robust unsupervised feature selection via dual self-representation and manifold regularization , 2018, Knowl. Based Syst..

[29]  Ping Zhang,et al.  Feature selection considering two types of feature relevancy and feature interdependency , 2018, Expert Syst. Appl..

[30]  Qinghua Hu,et al.  Co-regularized unsupervised feature selection , 2018, Neurocomputing.

[31]  Ming Shao,et al.  Infinite ensemble clustering , 2017, Data Mining and Knowledge Discovery.

[32]  Xiaofeng Zhu,et al.  Graph self-representation method for unsupervised feature selection , 2017, Neurocomputing.

[33]  René Vidal,et al.  Structured Sparse Subspace Clustering: A Joint Affinity Learning and Subspace Clustering Framework , 2016, IEEE Transactions on Image Processing.

[34]  Yingzhen Yang,et al.  ℓ ^0 ℓ 0 -Sparse Subspace Clustering , 2016, ECCV.

[35]  Shuai Wang,et al.  UDSFS: Unsupervised deep sparse feature selection , 2016, Neurocomputing.

[36]  Junbin Gao,et al.  Laplacian Regularized Low-Rank Representation and Its Applications , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Ali Farhadi,et al.  Unsupervised Deep Embedding for Clustering Analysis , 2015, ICML.

[38]  Beatriz de la Iglesia,et al.  Survey on Feature Selection , 2015, ArXiv.

[39]  Xiangjian He,et al.  Unsupervised Feature Selection Method for Intrusion Detection System , 2015, 2015 IEEE Trustcom/BigDataSE/ISPA.

[40]  Xiaojie Guo,et al.  Robust Subspace Segmentation by Simultaneously Learning Data Representations and Their Affinity Matrix , 2015, IJCAI.

[41]  Daniel P. Robinson,et al.  Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  David Zhang,et al.  Non-convex Regularized Self-representation for Unsupervised Feature Selection , 2015, IScIDE.

[43]  Liang Du,et al.  Unsupervised Feature Selection with Adaptive Structure Learning , 2015, KDD.

[44]  Simon C. K. Shiu,et al.  Unsupervised feature selection by regularized self-representation , 2015, Pattern Recognit..

[45]  Jing Liu,et al.  Clustering-Guided Sparse Structural Learning for Unsupervised Feature Selection , 2014, IEEE Transactions on Knowledge and Data Engineering.

[46]  Jian Yang,et al.  Robust Subspace Segmentation Via Low-Rank Representation , 2014, IEEE Transactions on Cybernetics.

[47]  Jianjiang Feng,et al.  Smooth Representation Clustering , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  Xuelong Li,et al.  Joint Embedding Learning and Sparse Regression: A Framework for Unsupervised Feature Selection , 2014, IEEE Transactions on Cybernetics.

[49]  Yannis Avrithis,et al.  Towards large-scale geometry indexing by feature selection , 2014, Comput. Vis. Image Underst..

[50]  Ming Liu,et al.  Feature selection and pose estimation from known planar objects using monocular vision , 2013, 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[51]  Shuicheng Yan,et al.  Correlation Adaptive Subspace Segmentation by Trace Lasso , 2013, 2013 IEEE International Conference on Computer Vision.

[52]  R. Vidal,et al.  Sparse Subspace Clustering: Algorithm, Theory, and Applications , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Hyojin Kim,et al.  Multi-feature Vehicle Detection Using Feature Selection , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[54]  Shuicheng Yan,et al.  Robust and Efficient Subspace Segmentation via Least Squares Regression , 2012, ECCV.

[55]  Jing Liu,et al.  Unsupervised Feature Selection Using Nonnegative Spectral Analysis , 2012, AAAI.

[56]  Hans-Peter Kriegel,et al.  Subspace clustering , 2012, WIREs Data Mining Knowl. Discov..

[57]  Nenghai Yu,et al.  Non-negative low rank and sparse graph for semi-supervised learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  Julien Mairal,et al.  Optimization with Sparsity-Inducing Penalties , 2011, Found. Trends Mach. Learn..

[59]  Yong Yu,et al.  Robust Recovery of Subspace Structures by Low-Rank Representation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Krzysztof Michalak,et al.  Correlation based feature selection method , 2010, Int. J. Bio Inspired Comput..

[61]  Chiyuan Zhang,et al.  Unsupervised feature selection for multi-cluster data , 2010, KDD.

[62]  Lei Wang,et al.  Efficient Spectral Feature Selection with Minimum Redundancy , 2010, AAAI.

[63]  Ehsan Elhamifar,et al.  Sparse subspace clustering , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[64]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[65]  Huan Liu,et al.  Spectral feature selection for supervised and unsupervised learning , 2007, ICML '07.

[66]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[67]  Mineichi Kudo,et al.  Non-parametric classifier-independent feature selection , 2006, Pattern Recognit..

[68]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[69]  S. Shankar Sastry,et al.  Generalized principal component analysis (GPCA) , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[70]  David J. Kriegman,et al.  Clustering appearances of objects under varying illumination conditions , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[71]  Christopher M. Bishop,et al.  Mixtures of Probabilistic Principal Component Analyzers , 1999, Neural Computation.

[72]  Takeo Kanade,et al.  A Multibody Factorization Method for Independently Moving Objects , 1998, International Journal of Computer Vision.

[73]  K. Esbensen,et al.  Principal component analysis , 1987 .

[74]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[75]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[76]  Ali Wagdy Mohamed,et al.  Metaheuristic Algorithms on Feature Selection: A Survey of One Decade of Research (2009-2019) , 2021, IEEE Access.

[77]  Zhensong Chen,et al.  Unsupervised feature selection by non-convex regularized self-representation , 2021, Expert Syst. Appl..

[78]  Gang Yu,et al.  A multi-stage semi-supervised improved deep embedded clustering (MS-SSIDEC) method for bearing fault diagnosis under the situation of insufficient labeled samples , 2021, ArXiv.

[79]  Afnan M. Alhassan,et al.  Review of Feature Selection, Dimensionality Reduction and Classification for Chronic Disease Diagnosis , 2021, IEEE Access.

[80]  Muhammad Shaheen,et al.  Importance of Features Selection, Attributes Selection, Challenges and Future Directions for Medical Imaging Data: A Review , 2020, Computer Modeling in Engineering & Sciences.

[81]  D. Damen,et al.  2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , 2019 .

[82]  Guangcan Liu,et al.  Implicit Block Diagonal Low-Rank Representation , 2018, IEEE Transactions on Image Processing.

[83]  Zhansheng Duan,et al.  Generalized Principal Component Analysis , 2017 .

[84]  Yangyang Li,et al.  Self-representation based dual-graph regularized feature selection clustering , 2016, Neurocomputing.

[85]  Lei Wang,et al.  Global and Local Structure Preservation for Feature Selection , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[86]  Zongben Xu,et al.  Regularization: Convergence of Iterative Half Thresholding Algorithm , 2014 .

[87]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[88]  P. Koistinen,et al.  Pattern Recognition , 2012, Lecture Notes in Computer Science.

[89]  Ying Cui,et al.  Convex Principal Feature Selection , 2010, SDM.

[90]  Chengqi Zhang,et al.  Missing Value Imputation Based on Data Clustering , 2008, Trans. Comput. Sci..

[91]  Frank Dellaert,et al.  EM, MCMC, and Chain Flipping for Structure from Motion with Unknown Correspondence , 2004, Machine Learning.

[92]  J. Galloway A Review of the , 1901 .