A group-specific tuning parameter for hybrid of SVM and SCAD in identification of informative genes and pathways

The pathway-based microarray classification approach leads to a new era of genomic research. However, this approach is limited by the issues in quality of pathway data. Usually the pathway data are curated from biological literatures and in specific biological experiment (e.g., lung cancer experiment), context free pathway information collection process takes place leading to the presence of uninformative genes in the pathways. Many methods in this approach neglect these limitations by treating all genes in a pathway as significant. In this paper, we proposed a hybrid of support vector machine and smoothly clipped absolute deviation with group-specific tuning parameters (gSVM-SCAD) to select informative genes within pathways before the pathway evaluation process. Our experiment on canine, gender and lung cancer datasets shows that gSVM-SCAD obtains significant results in identifying significant genes and pathways and in classification accuracy.

[1]  Reuben Lotan,et al.  Increased Retinoic Acid Responsiveness in Lung Carcinoma Cells that Are Nonresponsive Despite the Presence of Endogenous Retinoic Acid Receptor (RAR) β by Expression of Exogenous Retinoid Receptors Retinoid X Receptor α, RARα, and RARγ , 2001 .

[2]  Runze Li,et al.  Tuning parameter selectors for the smoothly clipped absolute deviation method. , 2007, Biometrika.

[3]  Marta Ruiz-Ortega,et al.  TGF-beta signaling in vascular fibrosis. , 2007, Cardiovascular research.

[4]  Jason Tsong-Li Wang,et al.  Kernel design for RNA classification using Support Vector Machines , 2006, Int. J. Data Min. Bioinform..

[5]  T Mirejovský,et al.  [Expression of p53, p21 and bcl-2 in prognosis of lung carcinomas]. , 1999, Ceskoslovenska patologie.

[6]  Raymond R Tubbs,et al.  Prospective Evaluation of TLE1 as a Diagnostic Immunohistochemical Marker in Synovial Sarcoma , 2009, The American journal of surgical pathology.

[7]  Yoshitaka Fujii,et al.  Histone deacetylase 1 mRNA expression in lung cancer. , 2004, Lung cancer.

[8]  W. Hong,et al.  Increased retinoic acid responsiveness in lung carcinoma cells that are nonresponsive despite the presence of endogenous retinoic acid receptor (RAR) beta by expression of exogenous retinoid receptors retinoid X receptor alpha, RAR alpha, and RAR gamma. , 2001, Cancer research.

[9]  R. Caltabiano,et al.  Expression of thyroid transcription factor 1 (TTF-1) in extra thyroidal sites: papillary thyroid carcinoma of branchial cleft cysts and thyroglossal duct cysts and struma ovarii. , 2006, Pathologica.

[10]  Hansong Zhang,et al.  Gacv for support vector machines , 2000 .

[11]  Young Chul Kim,et al.  Identification of polymorphisms in the XIAP gene and analysis of association with lung cancer risk in a Korean population. , 2008, Cancer genetics and cytogenetics.

[12]  G. Wahba Support vector machines, reproducing kernel Hilbert spaces, and randomized GACV , 1999 .

[13]  Monika Kosacka,et al.  [The evaluation of prognostic value of cyclin B1 expression in patients with resected non-small-cell lung cancer stage I-IIIA--preliminary report]. , 2010, Polski merkuriusz lekarski : organ Polskiego Towarzystwa Lekarskiego.

[14]  Chi-Ying F. Huang,et al.  Selection of DDX5 as a novel internal control for Q-RT-PCR from microarray data using a block bootstrap re-sampling scheme , 2007, BMC Genomics.

[15]  Pei-Ting Lee,et al.  7-Chloro-6-piperidin-1-yl-quinoline-5,8-dione (PT-262), a novel synthetic compound induces lung carcinoma cell death associated with inhibiting ERK and CDC2 phosphorylation via a p53-independent pathway , 2008, Cancer Chemotherapy and Pharmacology.

[16]  Yili Yang,et al.  TNF-RII and c-IAP1 mediate ubiquitination and degradation of TRAF2 , 2002, Nature.

[17]  Chang Ho Kim,et al.  Polymorphisms in the Caspase Genes and the Risk of Lung Cancer , 2010, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[18]  Philippe Collas,et al.  The a-Kinase–Anchoring Protein Akap95 Is a Multivalent Protein with a Key Role in Chromatin Condensation at Mitosis , 1999, The Journal of cell biology.

[19]  Jing Nie,et al.  Lung cancer in humans is not associated with lifetime total alcohol consumption or with genetic variation in alcohol dehydrogenase 3 (ADH3). , 2003, The Journal of nutrition.

[20]  Min-Xin Guan,et al.  Pathogenic mutations of nuclear genes associated with mitochondrial disorders. , 2009, Acta biochimica et biophysica Sinica.

[21]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[22]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[23]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[24]  Kaitai Zhang,et al.  Identification of differentially expressed genes in human lung squamous cell carcinoma using suppression subtractive hybridization. , 2004, Cancer letters.

[25]  Alnawaz Rehemtulla,et al.  Nuclear Localized Phosphorylated FADD Induces Cell Proliferation and is Associated with Aggressive Lung Cancer , 2005, Cell cycle.

[26]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[27]  Xi Chen,et al.  Supervised principal component analysis for gene set enrichment of microarray data with continuous or survival outcomes , 2008, Bioinform..

[28]  J. M. Kelley,et al.  Analysis of genetic stability at the EP300 and CREBBP loci in a panel of cancer cell lines , 2003, Genes, chromosomes & cancer.

[29]  Mihaela Campan,et al.  Identification of a panel of sensitive and specific DNA methylation markers for squamous cell lung cancer , 2008, Molecular Cancer.

[30]  Li Li,et al.  HLungDB: an integrated database of human lung cancer research , 2009, Nucleic Acids Res..

[31]  Ulrich Mansmann,et al.  GlobalANCOVA: exploration and assessment of gene group effects , 2008, Bioinform..

[32]  Zengyou He,et al.  Stable Feature Selection for Biomarker Discovery , 2010, Comput. Biol. Chem..

[33]  A M Treston,et al.  Expression in human lung cancer cell lines of genes of prohormone processing and the neuroendocine phenotype , 1996, Journal of cellular biochemistry. Supplement.

[34]  Ming Wu,et al.  Gene module level analysis: identification to networks and dynamics. , 2008, Current opinion in biotechnology.

[35]  R. Ramlau,et al.  Serum thrombopoietin levels in patients with reactive thrombocytosis due to lung cancer and in patients with essential thrombocythemia. , 2003, Neoplasma.

[36]  Biao He,et al.  Wnt signaling in lung cancer. , 2005, Cancer letters.

[37]  Axel Benner,et al.  penalizedSVM: a R-package for feature selection SVM classification , 2009, Bioinform..

[38]  Hongyu Zhao,et al.  Acute Drug-Induced Vascular Injury in Beagle Dogs: Pathology and Correlating Genomic Expression , 2006, Toxicologic pathology.

[39]  Boris Zhivotovsky,et al.  Expression of inhibitor of apoptosis proteins in small- and non-small-cell lung carcinoma cells. , 2002, Experimental cell research.

[40]  Vasilis Vasiliou,et al.  Arachidonic acid suppresses growth of human lung tumor A549 cells through down-regulation of ALDH3A1 expression. , 2006, Free radical biology & medicine.

[41]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[42]  E. Lander,et al.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Emmanouil T Dermitzakis,et al.  Large-Scale Population Study of Human Cell Lines Indicates that Dosage Compensation Is Virtually Complete , 2007, PLoS genetics.

[44]  Wei Pan,et al.  Incorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data , 2007, Bioinform..

[45]  H. Yoshiji,et al.  pp60c-src activation in lung adenocarcinoma. , 2003, European journal of cancer.

[46]  Xiaodong Lin,et al.  Gene expression Gene selection using support vector machines with non-convex penalty , 2005 .

[47]  Takashi Nakashima,et al.  Wnt1 overexpression promotes tumour progression in non-small cell lung cancer. , 2008, European journal of cancer.

[48]  Xiwu Lin,et al.  Smoothing spline ANOVA models for large data sets with Bernoulli observations and the randomized GACV , 2000 .

[49]  Wei Pan,et al.  Incorporating prior knowledge of predictors into penalized classifiers with multiple penalty terms , 2007, Bioinform..

[50]  Hongyu Zhao,et al.  Pathway analysis using random forests classification and regression , 2006, Bioinform..

[51]  Datong Zheng,et al.  Lung cancer risk associated with Thr495Pro polymorphism of GHR in Chinese population. , 2008, Japanese journal of clinical oncology.

[52]  Marta Ruiz-Ortega,et al.  TGF-β signaling in vascular fibrosis , 2007 .