GT-kernelPLS: Game theory based hybrid gene selection method for microarray data classification

Gene selection prior to classification has been an important topic in bioinformatics, since last decade. Small sample size and high dimensionality in microarray data pose great challenges for performing efficient classification. In this paper we propose efficient hybrid method (GTkernelPLS) with a combination of wrapper like technique coalitional game theory and kernel partial least square (kernelPLS) filter method. Experimental results on ten microarray data sets ensure that GTkernelPLS achieve higher accuracy than several state of the art feature selection methods, and it exhibits a reasonable execution time, even for the data sets having more than twenty thousand genes.

[1]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[2]  M. Xiong,et al.  Biomarker Identification by Feature Wrappers , 2022 .

[3]  Shiquan Sun,et al.  A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification , 2014, PloS one.

[4]  BMC Bioinformatics , 2005 .

[5]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[6]  N. Ramaraj,et al.  A novel hybrid feature selection via Symmetrical Uncertainty ranking based local memetic search algorithm , 2010, Knowl. Based Syst..

[7]  Paul Horton,et al.  Network-based de-noising improves prediction from microarray data , 2006, BMC Bioinformatics.

[8]  Eytan Ruppin,et al.  Feature Selection via Coalitional Game Theory , 2007, Neural Computation.

[9]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[10]  Yi Liu,et al.  FS_SFS: A novel feature selection method for support vector machines , 2006, Pattern Recognit..

[11]  Marko Robnik-Sikonja,et al.  An adaptation of Relief for attribute estimation in regression , 1997, ICML.

[12]  Thomas A. Darden,et al.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method , 2001, Bioinform..

[13]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..