Application of Support Vector Machines in Virtual Screening

Traditionally drug discovery has been a labor intensive effort, since it is difficult to identify a possible drug candidate from an extremely large small molecule library for any given target. Most of the small molecules fail to show any activity against the target because of electrochemical, structural and other incompatibilities. Virtual screening is an in-silico approach to identify drug candidates which are unlikely to show any activity against a given target, thus reducing an enormous amount of experimentation which is most likely to end up as failures. Important approaches in virtual screening have been through docking studies and using classification techniques. Support vector machines based classifiers, based on the principles of statistical learning theory have found several applications in virtual screening. In this paper, first the theory and main principles of SVM are briefly outlined. Thereafter a few successful applications of SVM in virtual screening have been discussed. It further underlines the pitfalls of the existing approaches and highlights the area which needs further contribution to improve the state of the art for application of SVM in virtual screening.

[1]  Michael K. Gilson,et al.  Virtual Screening of Molecular Databases Using a Support Vector Machine , 2005, J. Chem. Inf. Model..

[2]  Ujjwal Maulik,et al.  Active Site Driven Ligand Design: an Evolutionary Approach , 2005, J. Bioinform. Comput. Biol..

[3]  Samy O. Meroueh,et al.  Docking to Erlotinib Off-Targets Leads to Inhibitors of Lung Cancer Cell Proliferation with Suitable in Vitro Pharmacokinetics , 2010 .

[4]  V. Vapnik Estimation of Dependences Based on Empirical Data , 2006 .

[5]  Gunnar Rätsch,et al.  Classifying 'Drug-likeness' with Kernel-Based Learning Methods , 2005, J. Chem. Inf. Model..

[6]  Gareth Jones,et al.  A genetic algorithm for flexible molecular overlay and pharmacophore elucidation , 1995, J. Comput. Aided Mol. Des..

[7]  Gisbert Schneider,et al.  SVM-Based Feature Selection for Characterization of Focused Compound Collections , 2004, J. Chem. Inf. Model..

[8]  James A. Foster,et al.  Evolving Molecules for Drug Design Using Genetic Algorithms via Molecular Trees , 2000, GECCO.

[9]  Jun Huan,et al.  Graph wavelet alignment kernels for drug virtual screening. , 2008, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[10]  Christian Igel,et al.  Active learning with support vector machines , 2014, WIREs Data Mining Knowl. Discov..

[11]  Thomas Lengauer,et al.  Novel technologies for virtual screening. , 2004, Drug discovery today.

[12]  Liwei Li,et al.  Target-Specific Support Vector Machine Scoring in Structure-Based Virtual Screening: Computational Validation, In Vitro Testing in Kinases, and Effects on Lung Cancer Cell Proliferation , 2011, J. Chem. Inf. Model..

[13]  Tudor I. Oprea On the information content of 2D and 3D descriptors for QSAR , 2002 .

[14]  David J. Diller,et al.  Use of Catalyst Pharmacophore Models for Screening of Large Combinatorial Libraries , 2002, J. Chem. Inf. Comput. Sci..

[15]  Sanghamitra Bandyopadhyay,et al.  IVGA3D: De novo ligand design using a variable sized tree representation. , 2010, Protein and peptide letters.

[16]  Sanghamitra Bandyopadhyay,et al.  Evolving fragments to lead molecules , 2010 .

[17]  Nanda Ghoshal,et al.  3-D-QSAR of N-substituted 4-amino-3,3-dialkyl-2(3H)-furanone GABA receptor modulators using molecular field analysis and receptor surface modelling study. , 2004, Bioorganic & medicinal chemistry letters.

[18]  Igor V. Pletnev,et al.  Drug Discovery Using Support Vector Machines. The Case Studies of Drug-likeness, Agrochemical-likeness, and Enzyme Inhibition Predictions , 2003, J. Chem. Inf. Comput. Sci..

[19]  Osman F. Güner,et al.  Pharmacophore perception, development, and use in drug design , 2000 .

[20]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[21]  Weifeng Liu,et al.  Adaptive and Learning Systems for Signal Processing, Communication, and Control , 2010 .

[22]  Kimito Funatsu,et al.  GA Strategy for Variable Selection in QSAR Studies: GA-Based Region Selection for CoMFA Modeling , 1998, J. Chem. Inf. Comput. Sci..

[23]  James M. Briggs,et al.  Comparative molecular field analysis (CoMFA) study of epothilones – tubulin depolymerization inhibitors: Pharmacophore development using 3D QSAR methods , 2001, J. Comput. Aided Mol. Des..

[24]  J. Scott Dixon,et al.  Flexible ligand docking using a genetic algorithm , 1995, J. Comput. Aided Mol. Des..

[25]  Hugo Kubinyi,et al.  Free Wilson Analysis. Theory, Applications and its Relationship to Hansch Analysis , 1988 .

[26]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[27]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[28]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[29]  Fei Liu,et al.  Pharmacophore identification of KSP inhibitors. , 2007, Bioorganic & medicinal chemistry letters.

[30]  Jean-Philippe Vert,et al.  Virtual screening of GPCRs: An in silico chemogenomics approach , 2008, BMC Bioinformatics.

[31]  Haifeng Chen,et al.  Comparative Study of QSAR/QSPR Correlations Using Support Vector Machines, Radial Basis Function Neural Networks, and Multiple Linear Regression , 2004, J. Chem. Inf. Model..

[32]  Tudor I. Oprea,et al.  On the Information Content of 2 D and 3 D Descriptors for QSAR , 2002 .

[33]  Irwin D. Kuntz,et al.  A genetic algorithm for structure-based de novo design , 2001, J. Comput. Aided Mol. Des..

[34]  Bernard F. Buxton,et al.  Support Vector Machines in Combinatorial Chemistry , 2001 .

[35]  Sean B. Holden,et al.  Support Vector Machines for ADME Property Classification , 2003 .

[36]  Thomas Hofmann,et al.  Predicting CNS Permeability of Drug Molecules: Comparison of Neural Network and Support Vector Machine Algorithms , 2002, J. Comput. Biol..

[37]  K. P. Soman,et al.  Wavelet Assignment Graph Kernel for Drug Virtual Screening , 2009, 2009 International Conference on Advances in Recent Technologies in Communication and Computing.