Ensemble Method Using Correlation-Based Feature Selection with Stratified Sampling for Classification

Ensemble methods are preferred as they represent good significance over specific predictor regarding accuracy and confidence in classification. This paper proposes here the ensemble method with multiple independent feature subsets in order to classify high-dimensional data in the area of the biomedicine using Correlation feature selection with Stratified Sampling and Radial Basis Functions Neural Network. First, the method selects the feature subsets using Correlation-based feature Selection with Stratified Sampling. It minimizes the redundancy in the features. After generating the feature subsets, each feature subset is trained using base classifier and then these results are combined using majority voting. The proposed method uses CFS-SS in ensemble classification method.

[1]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[3]  Keun Ho Ryu,et al.  An ensemble correlation-based gene selection algorithm for cancer classification with gene expression data , 2012, Bioinform..

[4]  Aboul Ella Hassanien,et al.  Ensemble-based classifiers for prostate cancer diagnosis , 2013, 2013 9th International Computer Engineering Conference (ICENCO).

[5]  Ashfaqur Rahman,et al.  Ensemble classifier generation using non-uniform layered clustering and Genetic Algorithm , 2013, Knowl. Based Syst..

[6]  Yong Deng,et al.  A novel feature selection method based on CFS in cancer recognition , 2012, 2012 IEEE 6th International Conference on Systems Biology (ISB).

[7]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[10]  Wlodzislaw Duch,et al.  A Kolmogorov-Smirnov Correlation-Based Filter for Microarray Data , 2007, ICONIP.