Predict potential drug targets from the ion channel proteins based on SVM.

The identification of molecular targets is a critical step in the drug discovery and development process. Ion channel proteins represent highly attractive drug targets implicated in a diverse range of disorders, in particular in the cardiovascular and central nervous systems. Due to the limits of experimental technique and low-throughput nature of patch-clamp electrophysiology, they remain a target class waiting to be exploited. In our study, we combined three types of protein features, primary sequence, secondary structure and subcellular localization to predict potential drug targets from ion channel proteins applying classical support vector machine (SVM) method. In addition, our prediction comprised two stages. In stage 1, we predicted ion channel target proteins based on whole-genome target protein characteristics. Firstly, we performed feature selection by Mann-Whitney U test, then made predictions to identify potential ion channel targets by SVM and designed a new evaluating indicator Q to prioritize results. In stage 2, we made a prediction based on known ion channel target protein characteristics. Genetic algorithm was used to select features and SVM was used to predict ion channel targets. Then, we integrated results of two stages, and found that five ion channel proteins appeared in both prediction results including CGMP-gated cation channel beta subunit and Gamma-aminobutyric acid receptor subunit alpha-5, etc., and four of which were relative to some nerve diseases. It suggests that these five proteins are potential targets for drug discovery and our prediction strategies are effective.

[1]  Kuo-Chen Chou,et al.  Predicting membrane protein types by the LLDA algorithm. , 2008, Protein and peptide letters.

[2]  X. Chen,et al.  SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence , 2003, Nucleic Acids Res..

[3]  Juan Cui,et al.  Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity , 2006, Proteomics.

[4]  K. Chou,et al.  Digital coding of amino acids based on hydrophobic index. , 2007, Protein and peptide letters.

[5]  R. Abagyan,et al.  Comprehensive identification of "druggable" protein ligand binding sites. , 2004, Genome informatics. International Conference on Genome Informatics.

[6]  K. Chou,et al.  Predicting protein structural classes with pseudo amino acid composition: an approach using geometric moments of cellular automaton image. , 2008, Journal of theoretical biology.

[7]  S. Nakanishi Molecular diversity of glutamate receptors and implications for brain function. , 1992, Science.

[8]  K. Chou,et al.  Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms , 2008, Nature Protocols.

[9]  J. Drews Drug discovery: a historical perspective. , 2000, Science.

[10]  T. Bhat,et al.  The Protein Data Bank and the challenge of structural genomics , 2000, Nature Structural Biology.

[11]  Kuo-Chen Chou,et al.  An in-depth analysis of the biological functional studies based on the NMR M2 channel structure of influenza A virus. , 2008, Biochemical and biophysical research communications.

[12]  F. Lottspeich,et al.  Proteomics--a new way for drug target discovery. , 1998, Arzneimittel-Forschung.

[13]  Kuo-Chen Chou,et al.  Insights from investigating the interactions of adamantane-based drugs with the M2 proton channel from the H1N1 swine virus. , 2009, Biochemical and biophysical research communications.

[14]  K. Chou A novel approach to predicting protein structural classes in a (20–1)‐D amino acid composition space , 1995, Proteins.

[15]  K. Chou,et al.  Hum-mPLoc: an ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites. , 2007, Biochemical and biophysical research communications.

[16]  K. Chou,et al.  Using maximum entropy model to predict protein secondary structure with single sequence. , 2009, Protein and peptide letters.

[17]  A. Hopkins,et al.  The druggable genome , 2002, Nature Reviews Drug Discovery.

[18]  M. Wang,et al.  Weighted-support vector machines for predicting membrane protein types based on pseudo-amino acid composition. , 2004, Protein engineering, design & selection : PEDS.

[19]  P. Hajduk,et al.  Predicting protein druggability. , 2005, Drug discovery today.

[20]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Identification of the Protein-Ligand Binding Network To Explain the Side Effects of CETP Inhibitors , 2009, PLoS Comput. Biol..

[21]  Y.Z. Chen,et al.  Enzyme family classification by support vector machines , 2004, Proteins.

[22]  K. Chou,et al.  Euk-mPLoc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites. , 2007, Journal of proteome research.

[23]  Andrew J. Doig,et al.  Properties and identification of human protein drug targets , 2009, Bioinform..

[24]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[25]  I. Muchnik,et al.  Prediction of protein folding class using global description of amino acid sequence. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[26]  K. Chou,et al.  Support vector machines for predicting the specificity of GalNAc-transferase , 2002, Peptides.

[27]  Yu Zong Chen,et al.  Support vector machines approach for predicting druggable proteins: recent progress in its exploration and investigation of its usefulness. , 2007, Drug discovery today.

[28]  Lukasz A. Kurgan,et al.  Prediction of structural classes for protein sequences and domains - Impact of prediction algorithms, sequence representation and homology, and test procedures on accuracy , 2006, Pattern Recognit..

[29]  J. Chou,et al.  Structure and mechanism of the M2 proton channel of influenza A virus , 2008, Nature.

[30]  Kuo-Chen Chou,et al.  Prediction protein structural classes with pseudo-amino acid composition: approximate entropy and hydrophobicity pattern. , 2008, Journal of theoretical biology.

[31]  K. Chou,et al.  Using Functional Domain Composition and Support Vector Machines for Prediction of Protein Subcellular Location* , 2002, The Journal of Biological Chemistry.

[32]  Kuo-Chen Chou,et al.  Prediction of G-protein-coupled receptor classes. , 2005, Journal of proteome research.

[33]  K. Chou,et al.  Application of SVM to predict membrane protein types. , 2004, Journal of theoretical biology.

[34]  Luhua Lai,et al.  Prediction of potential drug targets based on simple sequence properties , 2007, BMC Bioinformatics.

[35]  N. Bhardwaj,et al.  Kernel-based machine learning protocol for predicting DNA-binding proteins , 2005, Nucleic acids research.

[36]  Peter Gwynne,et al.  Drug Discovery: 5 , 2002 .

[37]  Kuo-Chen Chou,et al.  Identify catalytic triads of serine hydrolases by support vector machines. , 2004, Journal of theoretical biology.

[38]  R. Peri,et al.  High-throughput electrophysiology: an emerging paradigm for ion-channel screening and physiology , 2008, Nature Reviews Drug Discovery.

[39]  Kuo-Chen Chou,et al.  GPCR‐CA: A cellular automaton image approach for predicting G‐protein–coupled receptor functional classes , 2009, J. Comput. Chem..

[40]  Kuo-Chen Chou Insights from modeling three-dimensional structures of the human potassium and sodium channels. , 2004, Journal of proteome research.

[41]  P. Dobson,et al.  Distinguishing enzyme structures from non-enzymes without alignments. , 2003, Journal of molecular biology.

[42]  R. Schiffmann,et al.  Mucolipidosis type IV is caused by mutations in a gene encoding a novel transient receptor potential channel. , 2000, Human molecular genetics.

[43]  Kuo-Chen Chou,et al.  Prediction of protein structure classes with pseudo amino acid composition and fuzzy support vector machine network. , 2007, Protein and peptide letters.

[44]  Roger Guimerà,et al.  A network-based method for target selection in metabolic networks , 2007, Bioinform..

[45]  S. Lampel,et al.  The druggable genome: an update. , 2005, Drug discovery today.

[46]  K. Chou,et al.  Support vector machines for predicting membrane protein types by using functional domain composition. , 2003, Biophysical journal.

[47]  K. Chou,et al.  Recent progress in protein subcellular location prediction. , 2007, Analytical biochemistry.

[48]  P. Hajduk,et al.  Druggability indices for protein targets derived from NMR-based screening data. , 2005, Journal of medicinal chemistry.

[49]  Chris H. Q. Ding,et al.  Multi-class protein fold recognition using support vector machines and neural networks , 2001, Bioinform..

[50]  Y. Zhou,et al.  Characterization of a calcium-activated chloride channel as a shared target of Th2 cytokine pathways and its potential involvement in asthma. , 2001, American journal of respiratory cell and molecular biology.

[51]  John P. Overington,et al.  How many drug targets are there? , 2006, Nature Reviews Drug Discovery.

[52]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Repositioning the Safe Medicine Comtan to Treat Multi-Drug and Extensively Drug Resistant Tuberculosis , 2009, PLoS Comput. Biol..

[53]  Zhirong Sun,et al.  Identifying genes related to drug anticancer mechanisms using support vector machine , 2002, FEBS letters.

[54]  Kuo-Chen Chou,et al.  GPCR-GIA: a web-server for identifying G-protein coupled receptors and their families with grey incidence analysis. , 2009, Protein engineering, design & selection : PEDS.

[55]  Guo-Ping Zhou,et al.  An Intriguing Controversy over Protein Structural Class Prediction , 1998, Journal of protein chemistry.

[56]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[57]  K. Chou,et al.  Prediction of protein structural classes. , 1995, Critical reviews in biochemistry and molecular biology.

[58]  Kuo-Chen Chou,et al.  Support vector machines for predicting HIV protease cleavage sites in protein , 2002, J. Comput. Chem..

[59]  D. Linden,et al.  Neurodegeneration in Lurcher mice caused by mutation in δ2 glutamate receptor gene , 1997, Nature.

[60]  Kuo-Chen Chou,et al.  Energetic analysis of the two controversial drug binding sites of the M2 proton channel in influenza A virus. , 2009, Journal of theoretical biology.

[61]  M. Claustres,et al.  Segregation of a mutation in CNGB1 encoding the β-subunit of the rod cGMP-gated channel in a family with autosomal recessive retinitis pigmentosa , 2001, Human Genetics.

[62]  L. Stubbs,et al.  Phenotypic consequences of deletion of the gamma 3, alpha 5, or beta 3 subunit of the type A gamma-aminobutyric acid receptor in mice. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[63]  J. Chou,et al.  Mechanism of drug inhibition and drug resistance of influenza A M2 channel , 2009, Proceedings of the National Academy of Sciences.