Comparison of Support Vector Machine Recursive Feature Elimination and Kernel Function as feature selection using Support Vector Machine for lung cancer classification

Cancer is the uncontrolled growth of abnormal cell that need a proper treatment. Cancer is second leading cause of death according to the World Health Organization in 2018. There are more than 120 types of cancer, one of them is lung cancer. Cancer classification has been able to maximize diagnosis, treatment, and management of cancer. Many studies have examined the classification of cancer using microarrays data. Microarray data consists of thousands of features (genes) but only has dozens or hundreds of samples. This can reduce the accuracy of classification so that the selection of features is needed before the classification process. In this research, the feature selection methods are Support Vector Machine Recursive Feature Elimination (SVM-RFE) and Kernel Function and the classification method is Support Vector Machine (SVM). The results showed SVM using SVM-RFE as feature selection is better than SVM method without using feature selection and Gaussian Kernel Function.

[1]  C. Carpenter,et al.  DNA methylation analysis: a powerful new tool for lung cancer diagnosis , 2002, Oncogene.

[2]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[3]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  C. Mountain,et al.  Regional lymph node classification for lung cancer staging. , 1997, Chest.

[6]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[7]  Yanqing Zhang,et al.  Recursive Fuzzy Granulation for Gene Subsets Extraction and Cancer Classification , 2008, IEEE Transactions on Information Technology in Biomedicine.

[8]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[9]  M. H. Shaheed,et al.  Cancer classification using clustering based gene selection and artificial neural networks , 2011, The 2nd International Conference on Control, Instrumentation and Automation.

[10]  E. C. Hammond,et al.  Smoking and lung cancer: recent evidence and a discussion of some questions. 1959. , 2009, International journal of epidemiology.

[11]  F. Azuaje,et al.  Multiple SVM-RFE for gene selection in cancer classification with expression data , 2005, IEEE Transactions on NanoBioscience.

[12]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .