Improvement of SVM Algorithm for Microarray Analysis Using Intelligent Parameter Selection

Identification of genetic markers is a crucial step in the diagnosis, prognosis, and treatment of disease. This paper focuses on the application of a supervised classification technique, support vector machines (SVM), to high dimensional microarrays for marker identification. A case study of renal cell carcinoma (RCC) is used here to demonstrate and test the ability of SVMs to identify real biological markers. SVMs are known to be suitable for high dimensional microarray data and are able to classify non-linear relationships in the data through the use of kernel functions specific to the datasets. This paper compares multiple SVM kernel functions, both linear and nonlinear, to determine which form is best suited for a particular dataset. Additionally, each SVM is tested across a range of parameters and normalization schemes to further identify a specific optimal classifier. Empirical results are then used to determine the optimum parameters for the SVM to efficiently find biologically important predictive markers for differentiation between RCC subtypes for the purpose of diagnosis and prognosis

[1]  A. Jemal,et al.  Cancer Statistics, 2004 , 2004, CA: a cancer journal for clinicians.

[2]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[3]  R. Figlin,et al.  Pathobiology, prognosis, and targeted therapy for renal cell carcinoma: exploiting the hypoxia-induced pathway. , 2003, Clinical cancer research : an official journal of the American Association for Cancer Research.

[4]  Mahul B. Amin,et al.  Prognostic Impact of Histologic Subtyping of Adult Renal Epithelial Neoplasms: An Experience of 405 Cases , 2002, The American journal of surgical pathology.

[5]  R. Ve,et al.  Contemporary approach to the classification of renal epithelial tumors. , 2000 .

[6]  Mark A. Rubin,et al.  Expression profiling of human renal carcinomas with functional taxonomic analysis , 2002, BMC Bioinformatics.

[7]  Y. Homma,et al.  INCREASED INCIDENTAL DETECTION AND REDUCED MORTALITY IN RENAL CANCER—RECENT RETROSPECTIVE ANALYSIS AT EIGHT INSTITUTIONS , 1995, International journal of urology : official journal of the Japanese Urological Association.

[8]  V. Reuter,et al.  Contemporary approach to the classification of renal epithelial tumors. , 2000, Seminars in oncology.

[9]  Constantin F. Aliferis,et al.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2004, Bioinform..

[10]  R. Figlin,et al.  Improved prognostication of renal cell carcinoma using an integrated staging system. , 2001, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[11]  Chih-Jen Lin,et al.  Radius Margin Bounds for Support Vector Machines with the RBF Kernel , 2002, Neural Computation.

[12]  M. S. Brown,et al.  Support Vector Machine Classification of Microarray from Gene Expression Data , 1999 .