Optimization model for Sub-feature selection in data mining

This paper proposes sub-feature selection based classification in data mining. Many existing feature selection methods are used to analyze traditional feature values as well as class values. But all feature's values don't support to distinguish the unique class from existing datasets, because these selected features are irrelevant towards traditional class. Thus it needs to analyze the feature's tiny data such as sub-feature data which help to find distinguish class. So this paper has considered identifying the new class from usual class. It selects needed sub-features from existing feature's data using optimization model. In this paper, it proposed Lagrangian functional method of optimization model for sub-feature selection. This method finds appropriate sub-feature data based on optimal evaluation value. It demonstrates the experiment using different dataset to get sub-feature data for novel class.

[1]  Jing Liu,et al.  Clustering-Guided Sparse Structural Learning for Unsupervised Feature Selection , 2014, IEEE Transactions on Knowledge and Data Engineering.

[2]  Nikhil R. Pal,et al.  Feature selection with SVD entropy: Some modification and extension , 2014, Inf. Sci..

[3]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[4]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[5]  Pengfei Wei,et al.  A General Domain Specific Feature Transfer Framework for Hybrid Domain Adaptation , 2019, IEEE Transactions on Knowledge and Data Engineering.

[6]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Michal Linial,et al.  Novel Unsupervised Feature Filtering of Biological Data , 2006, ISMB.

[8]  Nikhil R. Pal,et al.  Unsupervised Feature Selection with Controlled Redundancy (UFeSCoR) , 2015, IEEE Transactions on Knowledge and Data Engineering.

[9]  Yang Weng,et al.  A Feature Selection Method for High Impedance Fault Detection , 2019, IEEE Transactions on Power Delivery.

[10]  Fernando Jiménez,et al.  Multiobjective Evolutionary Feature Selection for Fuzzy Classification , 2019, IEEE Transactions on Fuzzy Systems.

[11]  Xuelong Li,et al.  Hierarchical Feature Selection for Random Projection , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Chin-Teng Lin,et al.  Discovery of dominant and dormant genes from expression data using a novel generalization of SNR for multi-class problems , 2008, BMC Bioinformatics.

[13]  Keshab K. Parhi,et al.  MUSE: Minimum Uncertainty and Sample Elimination Based Binary Feature Selection , 2019, IEEE Transactions on Knowledge and Data Engineering.

[14]  Chaoqun Li,et al.  Wrapper Framework for Test-Cost-Sensitive Feature Selection , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[15]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[16]  I. Chung,et al.  Identification of Single- and Multiple-Class Specific Signature Genes from Gene Expression Profiles by Group Marker Index , 2011, PloS one.

[17]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Hemanta Kumar Bhuyan,et al.  Privacy preserving sub-feature selection in distributed data mining , 2015, Appl. Soft Comput..

[19]  Nicolaj Søndberg-Madsen,et al.  Unsupervised Feature Subset Selection , 2003 .

[20]  Lei Wang,et al.  Feature Selection With Redundancy-Constrained Class Separability , 2010, IEEE Transactions on Neural Networks.

[21]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[22]  Sanjit Kumar Dash,et al.  An Approach for Privacy Preservation of Distributed Data in Peer-to-Peer Network using Multiparty Computation , 2011 .

[23]  Biju Patanaik,et al.  Privacy Preserving for Feature Selection in Data Mining Using Centralized Network , 2012 .

[24]  Nikhil R. Pal A fuzzy rule based approach to identify biomarkers for diagnostic classification of cancers , 2007, 2007 IEEE International Fuzzy Systems Conference.

[25]  Chris H. Q. Ding,et al.  Minimum redundancy feature selection from microarray gene expression data , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[26]  Hemanta Kumar Bhuyan,et al.  Privacy preserving sub-feature selection based on fuzzy probabilities , 2014, Cluster Computing.

[27]  Hemanta Kumar Bhuyan,et al.  Pareto-based multi-objective optimization for classification in data mining , 2016, Cluster Computing.

[28]  Sanjit Kumar Dash,et al.  Privacy Preservation with Penalty in Decentralized Network using Multiparty Computation , 2012 .

[29]  Feiping Nie,et al.  Feature Selection via Global Redundancy Minimization , 2015, IEEE Transactions on Knowledge and Data Engineering.

[30]  Huan Liu,et al.  Unsupervised feature selection for linked social media data , 2012, KDD.

[31]  Hemanta Kumar Bhuyan,et al.  Sub-Feature Selection for Novel Classification , 2018, 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT).

[32]  Nikhil R. Pal,et al.  Discovering biomarkers from gene expression data for predicting cancer subgroups using neural networks and relational fuzzy clustering , 2007, BMC Bioinformatics.