The Virtual Screening of the Drug Protein with a Few Crystal Structures Based on the Adaboost-SVM

Using the theory of machine learning to assist the virtual screening (VS) has been an effective plan. However, the quality of the training set may reduce because of mixing with the wrong docking poses and it will affect the screening efficiencies. To solve this problem, we present a method using the ensemble learning to improve the support vector machine to process the generated protein-ligand interaction fingerprint (IFP). By combining multiple classifiers, ensemble learning is able to avoid the limitations of the single classifier's performance and obtain better generalization. According to the research of virtual screening experiment with SRC and Cathepsin K as the target, the results show that the ensemble learning method can effectively reduce the error because the sample quality is not high and improve the effect of the whole virtual screening process.

[1]  Q. Zou,et al.  Hierarchical Classification of Protein Folds Using a Novel Ensemble Classifier , 2013, PloS one.

[2]  Xianghui Liu,et al.  SVM Model for Virtual Screening of Lck Inhibitors , 2009, J. Chem. Inf. Model..

[3]  Giulio Rastelli,et al.  Enrichment Factor Analyses on G-Protein Coupled Receptors with Known Crystal Structure , 2013, J. Chem. Inf. Model..

[4]  D. Brömme,et al.  The role of basic amino acid surface clusters on the collagenase activity of cathepsin K. , 2013, Biochemistry.

[5]  Masashi Narita,et al.  Identification of a Selective G1-Phase Benzimidazolone Inhibitor by a Senescence-Targeted Virtual Screen Using Artificial Neural Networks12 , 2015, Neoplasia.

[6]  Ying Ju,et al.  Improving tRNAscan‐SE Annotation Results via Ensemble Classifiers , 2015, Molecular informatics.

[7]  I. So,et al.  Dynamic modulation of the kv2.1 channel by SRC-dependent tyrosine phosphorylation. , 2012, Journal of proteome research.

[8]  Xiangxiang Zeng,et al.  nDNA-prot: identification of DNA-binding proteins based on unbalanced classification , 2014, BMC Bioinformatics.

[9]  Teruki Honma,et al.  Combining Machine Learning and Pharmacophore-Based Interaction Fingerprint for in Silico Screening , 2010, J. Chem. Inf. Model..

[10]  Sarita Rajender Potlapally,et al.  Homology modeling and virtual screening of ubiquitin conjugation enzyme E2A for designing a novel selective antagonist against cancer , 2015, Journal of receptor and signal transduction research.

[11]  Yoshihiro Yamanishi,et al.  Benchmarking a Wide Range of Chemical Descriptors for Drug‐Target Interaction Prediction Using a Chemogenomic Approach , 2014, Molecular informatics.

[12]  B. Liu,et al.  An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier , 2013, BioMed research international.

[13]  Jaroslaw Polanski,et al.  Ligand-Based Virtual Screening in a Search for Novel Anti-HIV-1 Chemotypes , 2015, J. Chem. Inf. Model..

[14]  Aldo R Boccaccini,et al.  Electrophoretic deposition of biological macromolecules, drugs, and cells. , 2013, Biomacromolecules.

[15]  Hitomi Yuki,et al.  Prediction of Ligand-Induced Structural Polymorphism of Receptor Interaction Sites Using Machine Learning , 2013, J. Chem. Inf. Model..

[16]  Hisashi Narimatsu,et al.  WURCS: The Web3 Unique Representation of Carbohydrate Structures , 2014, J. Chem. Inf. Model..

[17]  Lars-Erik Wernersson,et al.  Hard X-ray detection using a single 100 nm diameter nanowire. , 2014, Nano letters.