QSAR Models for Phosphoramidate Prodrugs of 2′‐Methylcytidine as Inhibitors of Hepatitis C Virus Based on PSO Boosting

In the current study, boosting regression has been proposed to model the activities of a series of phosphoramidate prodrugs of 2′‐methylcytidine as inhibitors of hepatitis C virus. The stepwise multiple linear regression and particle swarm optimization strategies are used to select descriptors which are responsible for the inhibitory activity of these compounds. As comparisons to the boosting regression method, the multiple linear regression, back‐propagation neural networks, and support vector machine have also been investigated. Experimental results have shown that the boosting can drastically enhance the generalization performance of individual multiple linear regression model and the particle swarm optimization‐boosting method is a well‐performing technique in quantitative structure–activity relationship studies superior to support vector machine. The squared correlation coefficient and standard deviation of the best model are 0.744 and 0.438 for the training set and 0.710 and 0.748 for the test set.

[1]  Mauro Birattari,et al.  Swarm Intelligence , 2012, Lecture Notes in Computer Science.

[2]  Colin W Shepard,et al.  Global epidemiology of hepatitis C virus infection. , 2005, The Lancet. Infectious diseases.

[3]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[4]  J. Duca,et al.  Discovery of 4H-pyrazolo[1,5-a]pyrimidin-7-ones as potent inhibitors of hepatitis C virus polymerase. , 2009, Bioorganic & medicinal chemistry letters.

[5]  Mohammad Goodarzi,et al.  Augmented Three-mode MIA-QSAR Modeling for a Series of Anti-HIV-1 Compounds , 2008 .

[6]  Tarko Laszlo,et al.  Monte Carlo method for identification of outlier molecules in QSAR studies , 2009 .

[7]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[8]  Jian-hui Jiang,et al.  QSAR study of angiotensin II antagonists using robust boosting partial least squares regression. , 2007, Analytica chimica acta.

[9]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[10]  D L Massart,et al.  Boosting partial least squares. , 2005, Analytical chemistry.

[11]  Gunnar Rätsch,et al.  Barrier Boosting , 2000, COLT.

[12]  Marjan Vracko,et al.  Kohonen Network Study of Aromatic Compounds Based on Electronic and Nonelectronic Structure Descriptors , 2005, J. Chem. Inf. Model..

[13]  S. Shafran,et al.  Treatment of Hepatitis C in HIV-Coinfected Patients , 2006, The Annals of pharmacotherapy.

[14]  V. Kostina,et al.  New acridone-4-carboxylic acid derivatives as potential inhibitors of hepatitis C virus infection. , 2008, Bioorganic & medicinal chemistry.

[15]  M. Erion,et al.  Liver-targeted prodrugs of 2'-C-methyladenosine for therapy of hepatitis C virus infection. , 2007, Journal of medicinal chemistry.

[16]  John Shawe-Taylor,et al.  Towards a strategy for boosting regressors , 2000 .

[17]  Ruisheng Zhang,et al.  QSAR and classification models of a novel series of COX-2 selective inhibitors: 1, 5-diarylimidazoles based on support vector machines , 2004, J. Comput. Aided Mol. Des..

[18]  Eric Bauer,et al.  An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.

[19]  M. Luini,et al.  Quantitative structure–activity relationship modelling of oral acute toxicity and cytotoxic activity of fragrance materials in rodents , 2009, SAR and QSAR in environmental research.

[20]  Niladri Chakraborty,et al.  Particle swarm optimization technique based short-term hydrothermal scheduling , 2008, Appl. Soft Comput..

[21]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[22]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[23]  Dong-Sheng Cao,et al.  The boosting: A new idea of building models , 2010 .

[24]  J. Neyts,et al.  The phosphoramidate ProTide approach greatly enhances the activity of beta-2'-C-methylguanosine against hepatitis C virus. , 2009, Bioorganic & medicinal chemistry letters.

[25]  Toniann Pitassi,et al.  A Gradient-Based Boosting Algorithm for Regression Problems , 2000, NIPS.

[26]  R. Bazzo,et al.  Cyclic phosphoramidates as prodrugs of 2'-C-methylcytidine. , 2009, European journal of medicinal chemistry.

[27]  F. Narjes,et al.  Synthesis and evaluation of novel phosphoramidate prodrugs of 2'-methyl cytidine as inhibitors of hepatitis C virus NS5B polymerase. , 2009, Bioorganic & medicinal chemistry letters.

[28]  Jin Cai,et al.  Simultaneously Optimized Support Vector Regression Combined With Genetic Algorithm for QSAR Analysis of KDR/VEGFR‐2 Inhibitors , 2010, Chemical biology & drug design.

[29]  H. Akaike A new look at the statistical model identification , 1974 .

[30]  Hxugo Kubiny Variable Selection in QSAR Studies. I. An Evolutionary Algorithm , 1994 .

[31]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[32]  S. Harper,et al.  Phosphoramidate prodrugs of 2'-C-methylcytidine for therapy of hepatitis C virus infection. , 2009, Journal of medicinal chemistry.

[33]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[34]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[35]  E. Castro,et al.  Anticonvulsant Activity of Ringed Enaminones: A QSAR Study , 2009 .

[36]  John Mao,et al.  Synthesis and pharmacokinetics of valopicitabine (NM283), an efficient prodrug of the potent anti-HCV agent 2'-C-methylcytidine. , 2006, Journal of medicinal chemistry.

[37]  David P. Helmbold,et al.  Leveraging for Regression , 2000, COLT.

[38]  Mehmet Fatih Tasgetiren,et al.  A discrete particle swarm optimization algorithm for the no-wait flowshop scheduling problem , 2008, Comput. Oper. Res..

[39]  H. M. Vinkers,et al.  Multivariate adaptive regression splines—studies of HIV reverse transcriptase inhibitors , 2004 .

[40]  H. Kubinyi Variable Selection in QSAR Studies. II. A Highly Efficient Combination of Systematic Search and Evolution , 1994 .

[41]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[42]  Zhengjun Cheng,et al.  Classification of 5-HT1A Receptor Ligands on the Basis of Their Binding Affinities by Using PSO-Adaboost-SVM , 2009, International journal of molecular sciences.

[43]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[44]  Zhengjun Cheng,et al.  QSAR studies of imidazopyridine derivatives as Et-PKG inhibitors using the PSO-SVM approach , 2010, Medicinal Chemistry Research.

[45]  Irini A. Doytchinova,et al.  Towards the chemometric dissection of peptide – HLA-A*0201 binding affinity: comparison of local and global QSAR models , 2005, J. Comput. Aided Mol. Des..

[46]  O. Deeb,et al.  Exploring QSARs for Inhibitory Activity of Non‐peptide HIV‐1 Protease Inhibitors by GA‐PLS and GA‐SVM , 2010, Chemical biology & drug design.