A robust graph-based semi-supervised sparse feature selection method

Abstract Feature selection is used for excluding redundant features and enhancing learning performance. Abundant unlabeled data are existed in many applications which can be used in semi-supervised feature selection. Semi-supervised sparse feature selection methods have been presented to apply labeled and unlabeled data and consider the correlation among features. Most of the sparse methods utilize square-norm based loss function which is sensitive to outliers. In this paper, we propose a robust Graph-based Semi-Supervised Sparse Feature Selection (GS3FS) method based on the mixed convex and non-convex l2,p-norm (0

[1]  K. Roy,et al.  On Two Novel Parameters for Validation of Predictive QSAR Models , 2009, Molecules.

[2]  Mohammad Ali Zare Chahooki,et al.  Feature selection based on graph Laplacian by using compounds with known and unknown activities , 2017 .

[3]  Changsheng Xu,et al.  A Generic Framework for Video Annotation via Semi-Supervised Learning , 2012, IEEE Transactions on Multimedia.

[4]  Yi Yang,et al.  Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[5]  A. Haji Agha Bozorgi,et al.  A structure–activity relationship survey of histone deacetylase (HDAC) inhibitors , 2012 .

[6]  Mohammad Ali Zare Chahooki,et al.  A Survey on semi-supervised feature selection methods , 2017, Pattern Recognit..

[7]  Li Jiang,et al.  Feature extraction based on semi-supervised kernel Marginal Fisher analysis and its application in bearing fault diagnosis , 2013 .

[8]  Wang Yao,et al.  L 1/2 regularization , 2010 .

[9]  C. Grady,et al.  Event-related fMRI studies of episodic encoding and retrieval: Meta-analyses using activation likelihood estimation , 2009, Neuropsychologia.

[10]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[11]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[12]  Songcan Chen,et al.  $l_{2,p}$ Matrix Norm and Its Application in Feature Selection , 2013, ArXiv.

[13]  Jiamin Liu,et al.  Enhanced semi-supervised local Fisher discriminant analysis for face recognition , 2012, Future Gener. Comput. Syst..

[14]  Nicu Sebe,et al.  Web Image Annotation Via Subspace-Sparsity Collaborated Feature Selection , 2012, IEEE Transactions on Multimedia.

[15]  Xiaojun Chang,et al.  Semisupervised Feature Analysis by Mining Correlations Among Multiple Tasks , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Jian Zhang,et al.  Semi-supervised feature selection based on local discriminative information , 2016, Neurocomputing.

[17]  Yan Zhang,et al.  Semi-supervised local multi-manifold Isomap by linear embedding for feature extraction , 2018, Pattern Recognit..

[18]  Mingze Li,et al.  Forest type identification by random forest classification combined with SPOT and multitemporal SAR data , 2017, Journal of Forestry Research.

[19]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[20]  Diego R. Amancio,et al.  Probing the Topological Properties of Complex Networks Modeling Short Written Texts , 2014, PloS one.

[21]  Razieh Sheikhpour,et al.  Semi-supervised sparse feature selection via graph Laplacian based scatter matrix for regression problems , 2018, Inf. Sci..

[22]  Yvan Vander Heyden,et al.  Towards better understanding of feature-selection or reduction techniques for Quantitative Structure–Activity Relationship models , 2013 .

[23]  Li Yan,et al.  Geometric-constrained multi-view image matching method based on semi-global optimization , 2018, Geo spatial Inf. Sci..

[24]  Qiuqi Ruan,et al.  Sparse feature selection based on graph Laplacian for web image annotation , 2014, Image Vis. Comput..

[25]  Hossam Faris,et al.  Asynchronous accelerating multi-leader salp chains for feature selection , 2018, Appl. Soft Comput..

[26]  David J. Sheskin,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 1997 .

[27]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[28]  Gholamreza Haffari,et al.  Transductive learning for statistical machine translation , 2007, ACL.

[29]  Bahram Hemmateenejad,et al.  Quantitative structure-retention relationship for the Kovats retention indices of a large set of terpenes: a combined data splitting-feature selection strategy. , 2007, Analytica chimica acta.

[30]  Ivor W. Tsang,et al.  Flexible Manifold Embedding: A Framework for Semi-Supervised and Unsupervised Dimension Reduction , 2010, IEEE Transactions on Image Processing.

[31]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[32]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[33]  Hossam Faris,et al.  An evolutionary gravitational search-based feature selection , 2019, Inf. Sci..

[34]  Nicu Sebe,et al.  Exploiting the entire feature space with sparsity for automatic image annotation , 2011, ACM Multimedia.

[35]  Qiuqi Ruan,et al.  Hessian Semi-Supervised Sparse Feature Selection Based on ${L_{2,1/2}}$ -Matrix Norm , 2015, IEEE Transactions on Multimedia.

[36]  Razieh Sheikhpour,et al.  A combined Fisher and Laplacian score for feature selection in QSAR based drug design using compounds with known and unknown activities , 2018, Journal of Computer-Aided Molecular Design.

[37]  Tommy W. S. Chow,et al.  Trace ratio criterion based generalized discriminative learning for semi-supervised dimensionality reduction , 2012, Pattern Recognit..

[38]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[39]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[40]  Michel Verleysen,et al.  Graph Laplacian for Semi-supervised Feature Selection in Regression Problems , 2011, IWANN.

[41]  Hossam Faris,et al.  Efficient Hybrid Nature-Inspired Binary Optimizers for Feature Selection , 2019, Cognitive Computation.

[42]  Stephen P. Boyd,et al.  Enhancing Sparsity by Reweighted ℓ1 Minimization , 2007, 0711.1612.

[43]  Michel Verleysen,et al.  A graph Laplacian based approach to semi-supervised feature selection for regression problems , 2013, Neurocomputing.

[44]  Xizhao Wang,et al.  Sensitivity analysis on initial classifier accuracy in fuzziness based semi-supervised learning , 2019, Inf. Sci..

[45]  Ronen Feldman,et al.  Using Corpus Statistics on Entities to Improve Semi-supervised Relation Extraction from the Web , 2007, ACL.

[46]  Nicu Sebe,et al.  Discriminating Joint Feature Analysis for Multimedia Data Understanding , 2012, IEEE Transactions on Multimedia.

[47]  Guandong Xu,et al.  Constrained nonnegative matrix factorization-based semi-supervised multilabel learning , 2019, Int. J. Mach. Learn. Cybern..

[48]  Jing-Yu Yang,et al.  Semi-supervised linear discriminant analysis for dimension reduction and classification , 2016, Pattern Recognit..

[49]  Hossam Faris,et al.  Evolutionary Population Dynamics and Grasshopper Optimization approaches for feature selection problems , 2017, Knowl. Based Syst..