Unsupervised feature selection via adaptive hypergraph regularized latent representation learning

Abstract Due to the rapid development of multimedia technology, a large number of unlabelled data with high dimensionality need to be processed. The high dimensionality of data not only increases the computation burden of computer hardware, but also hinders algorithms to obtain optimal performance. Unsupervised feature selection, which is regarded as a means of dimensionality reduction, has been widely recognized as an important and challenging pre-step for many machine learning and data mining tasks. However, we observe that there are at least two issues in previous unsupervised feature selection methods. Firstly, traditional unsupervised feature selection algorithms usually assume that the data instances are identically distributed and there is no dependency between them. However, the data instances are not only associated with high dimensional features but also inherently interconnected with each other. Secondly, the traditional similarity graph used in previous methods can only describe the pair-wise relations of data, but cannot capture the high-order relations, so that the complex structures implied in the data cannot be sufficiently exploited. In this work, we propose a robust unsupervised feature selection method which embeds the latent representation learning into feature selection. Instead of measuring the feature importances in original data space, the feature selection is carried out in the learned latent representation space which is more robust to noises. In order to capture the local manifold geometrical structure of original data in a high-order manner, a hypergraph is adaptively learned and embedded into the resultant model. An efficient alternating algorithm is developed to optimize the problem. Experimental results on eight benchmark data sets demonstrate the effectiveness of the proposed method.

[1]  Huan Liu,et al.  Robust Unsupervised Feature Selection on Networked Data , 2016, SDM.

[2]  Deng Cai,et al.  Unsupervised feature selection for multi-cluster data , 2010, KDD.

[3]  Huan Liu,et al.  Spectral feature selection for supervised and unsupervised learning , 2007, ICML '07.

[4]  Huan Liu,et al.  Embedded Unsupervised Feature Selection , 2015, AAAI.

[5]  V. S. Shankar Sriram,et al.  Hypergraph Based Feature Selection Technique for Medical Diagnosis , 2016, Journal of Medical Systems.

[6]  Qinghua Hu,et al.  Subspace clustering guided unsupervised feature selection , 2017, Pattern Recognit..

[7]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[8]  Jun Morimoto,et al.  Model-based reinforcement learning with dimension reduction , 2016, Neural Networks.

[9]  Chris H. Q. Ding,et al.  Symmetric Nonnegative Matrix Factorization for Graph Clustering , 2012, SDM.

[10]  Wilfried Philips,et al.  Feature Extraction of Hyperspectral Images With Semisupervised Graph Learning , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[11]  V. Milman New proof of the theorem of A. Dvoretzky on intersections of convex bodies , 1971 .

[12]  Bo Wang,et al.  Learning from label proportions on high-dimensional data , 2018, Neural Networks.

[13]  Wei-Ying Ma,et al.  An adaptive graph model for automatic image annotation , 2006, MIR '06.

[14]  Lei Wang,et al.  Efficient Spectral Feature Selection with Minimum Redundancy , 2010, AAAI.

[15]  Xinwang Liu,et al.  Cross-View Local Structure Preserved Diversity and Consensus Learning for Multi-View Unsupervised Feature Selection , 2019, AAAI.

[16]  Xiaoli Chen,et al.  Global and intrinsic geometric structure embedding for unsupervised feature selection , 2018, Expert Syst. Appl..

[17]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Xinwang Liu,et al.  Learning a Joint Affinity Graph for Multiview Subspace Clustering , 2019, IEEE Transactions on Multimedia.

[19]  Aleksandra Pizurica,et al.  Semisupervised Local Discriminant Analysis for Feature Extraction in Hyperspectral Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Jie Tian,et al.  Robust graph regularized unsupervised feature selection , 2018, Expert Syst. Appl..

[21]  Petros Drineas,et al.  Feature selection for linear SVM with provable guarantees , 2014, Pattern Recognit..

[22]  Richard Weber,et al.  A wrapper method for feature selection using Support Vector Machines , 2009, Inf. Sci..

[23]  Jing Liu,et al.  Unsupervised Feature Selection Using Nonnegative Spectral Analysis , 2012, AAAI.

[24]  Lei Wang,et al.  Global and Local Structure Preservation for Feature Selection , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Feiping Nie,et al.  Trace Ratio Criterion for Feature Selection , 2008, AAAI.

[26]  Anil K. Jain,et al.  Feature Selection: Evaluation, Application, and Small Sample Performance , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Zi Huang,et al.  Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence ℓ2,1-Norm Regularized Discriminative Feature Selection for Unsupervised Learning , 2022 .

[28]  Xiaofeng Zhu,et al.  Local and Global Structure Preservation for Robust Unsupervised Spectral Feature Selection , 2018, IEEE Transactions on Knowledge and Data Engineering.

[29]  D. Hunter,et al.  Optimization Transfer Using Surrogate Objective Functions , 2000 .

[30]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[31]  Pichao Wang,et al.  Robust unsupervised feature selection via dual self-representation and manifold regularization , 2018, Knowl. Based Syst..

[32]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[33]  Zhaoshui He,et al.  Symmetric Nonnegative Matrix Factorization: Algorithms and Applications to Probabilistic Clustering , 2011, IEEE Transactions on Neural Networks.

[34]  Simon C. K. Shiu,et al.  Unsupervised feature selection by regularized self-representation , 2015, Pattern Recognit..

[35]  Witold Pedrycz,et al.  Subspace learning for unsupervised feature selection via matrix factorization , 2015, Pattern Recognit..

[36]  Xiaofeng Zhu,et al.  Graph self-representation method for unsupervised feature selection , 2017, Neurocomputing.

[37]  Gustavo Camps-Valls,et al.  Semi-Supervised Graph-Based Hyperspectral Image Classification , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[38]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Christos Boutsidis,et al.  Deterministic Feature Selection for K-Means Clustering , 2011, IEEE Transactions on Information Theory.

[40]  Witold Pedrycz,et al.  Unsupervised feature selection via maximum projection and minimum redundancy , 2015, Knowl. Based Syst..

[41]  Carla E. Brodley,et al.  Feature Selection for Unsupervised Learning , 2004, J. Mach. Learn. Res..

[42]  Xinwang Liu,et al.  Unsupervised feature selection via latent representation learning and manifold regularization , 2019, Neural Networks.

[43]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[44]  Jiawei Han,et al.  Joint Feature Selection and Subspace Learning , 2011, IJCAI.

[45]  Richa Singh,et al.  Adaptive latent fingerprint segmentation using feature selection and random decision forest classification , 2017, Inf. Fusion.

[46]  Aristidis Likas,et al.  Bayesian feature and model selection for Gaussian mixture models , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Marco Cristani,et al.  Infinite Feature Selection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[48]  Qinghua Hu,et al.  Non-convex regularized self-representation for unsupervised feature selection , 2015, Image Vis. Comput..

[49]  Chao Xu,et al.  Autoencoder Inspired Unsupervised Feature Selection , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[50]  Chang Tang,et al.  Gene selection for microarray data classification via subspace learning and manifold regularization , 2017, Medical & Biological Engineering & Computing.

[51]  Witold Pedrycz,et al.  Global and local structure preserving sparse subspace learning: An iterative approach to unsupervised feature selection , 2015, Pattern Recognit..

[52]  Xinwang Liu,et al.  Feature Selective Projection with Low-Rank Embedding and Dual Laplacian Regularization , 2020, IEEE Transactions on Knowledge and Data Engineering.

[53]  R. Abseher,et al.  Microarray gene expression profiling of B-cell chronic lymphocytic leukemia subgroups defined by genomic aberrations and VH mutation status. , 2004, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[54]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[55]  Lei Wang,et al.  On Similarity Preserving Feature Selection , 2013, IEEE Transactions on Knowledge and Data Engineering.

[56]  Pichao Wang,et al.  Adaptive Hypergraph Embedded Semi-Supervised Multi-Label Image Annotation , 2019, IEEE Transactions on Multimedia.

[57]  Ludovic Denoyer,et al.  Learning latent representations of nodes for classifying in heterogeneous social networks , 2014, WSDM.

[58]  Lior Wolf,et al.  Feature selection for unsupervised and supervised inference: the emergence of sparsity in a weighted-based approach , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[59]  Peng Liu,et al.  Link the remote sensing big data to the image features via wavelet transformation , 2016, Cluster Computing.

[60]  Y. Munakata,et al.  Active versus latent representations: a neural network model of perseveration, dissociation, and decalage. , 2002, Developmental psychobiology.

[61]  Pichao Wang,et al.  Consensus learning guided multi-view unsupervised feature selection , 2018, Knowl. Based Syst..

[62]  Ronghua Shang,et al.  Subspace learning-based graph regularized feature selection , 2016, Knowl. Based Syst..

[63]  Nicoletta Dessì,et al.  Exploiting the ensemble paradigm for stable feature selection: A case study on high-dimensional genomic data , 2017, Inf. Fusion.

[64]  Gang Hua,et al.  Hyperspectral Image Classification Through Bilayer Graph-Based Learning , 2014, IEEE Transactions on Image Processing.

[65]  Huan Liu,et al.  Feature Selection with Linked Data in Social Media , 2012, SDM.

[66]  Xindong Wu,et al.  Feature selection using hierarchical feature clustering , 2011, CIKM '11.

[67]  Gang Chen,et al.  Comparison and integration of feature reduction methods for land cover classification with RapidEye imagery , 2017, Multimedia Tools and Applications.

[68]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[69]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[70]  Christos Boutsidis,et al.  Unsupervised Feature Selection for the $k$-means Clustering Problem , 2009, NIPS.

[71]  George D. C. Cavalcanti,et al.  META-DES.Oracle: Meta-learning and feature selection for dynamic ensemble selection , 2017, Inf. Fusion.

[72]  Pichao Wang,et al.  Online human action recognition based on incremental learning of weighted covariance descriptors , 2018, Inf. Sci..

[73]  Thomas P. Trappenberg,et al.  A biological mechanism for Bayesian feature selection: Weight decay and raising the LASSO , 2015, Neural Networks.

[74]  Huan Liu,et al.  Feature selection for classification: A review , 2014 .

[75]  Anil K. Jain,et al.  Simultaneous feature selection and clustering using mixture models , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[76]  Qinghua Zheng,et al.  Adaptive Unsupervised Feature Selection With Structure Regularization , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[77]  Xuelong Li,et al.  Unsupervised Feature Selection with Structured Graph Optimization , 2016, AAAI.

[78]  Lina Yao,et al.  Unsupervised Feature Analysis with Class Margin Optimization , 2015, ECML/PKDD.

[79]  Petros Drineas,et al.  Feature Selection for Ridge Regression with Provable Guarantees , 2016, Neural Computation.