Support Vector Machine‐Based Quantitative Structure–Activity Relationship Study of Cholesteryl Ester Transfer Protein Inhibitors

To explore inhibition of cholesteryl ester transfer protein, a support vector machine in quantitative structure–activity relationship was developed for modeling cytotoxicity data for a series of cholesteryl ester transfer protein inhibitors. A large number of descriptors were calculated and genetic algorithm was used to select variables that resulted in the best‐fitted models. The data set was randomly divided into 68 molecules of training and 17 molecules of test set. The selected molecular descriptors were used as inputs for support vector machine. The obtained results using support vector machine were compared with those of multiple linear regression which revealed superiority of the support vector machine model over the multiple linear regression. The root mean square errors of the training set and the test set for support vector machine model were calculated to be 3.707, 5.273 and the correlation coefficients (r 2) were obtained to be 0.947, 0.899, respectively. The obtained statistical parameter of leave‐one‐out cross‐validation test correlation coefficients (q 2) on support vector machine model was 0.852, which indicates the reliability of the proposed model.

[1]  Johann Gasteiger,et al.  Prediction of 1H NMR chemical shifts using neural networks. , 2002, Analytical chemistry.

[2]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[3]  A. Andricopulo,et al.  2D Quantitative structure-activity relationship studies on a series of cholesteryl ester transfer protein inhibitors. , 2007, Bioorganic & medicinal chemistry.

[4]  I. Gilbert,et al.  A QSAR study investigating the effect of L-alanine ester variation on the anti-HIV activity of some phosphoramidate derivatives of d4T. , 2000, Bioorganic & medicinal chemistry letters.

[5]  Jianzhong Liu,et al.  Effect of cholesterol on DMPC phospholipid membranes and QSAR model construction in membrane-interaction QSAR study through molecular dynamics simulation. , 2006, Bioorganic & medicinal chemistry.

[6]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[7]  Ruisheng Zhang,et al.  QSAR Study of Ethyl 2-[(3-Methyl-2, 5-dioxo(3-pyrrolinyl))amino]-4-(trifluoromethyl) pyrimidine-5-carboxylate: An Inhibitor of AP-1 and NF-B Mediated Gene Expression Based on Support Vector Machines , 2003, J. Chem. Inf. Comput. Sci..

[8]  P. Khadikar,et al.  QSAR prediction of toxicity of nitrobenzenes. , 2001, Bioorganic & medicinal chemistry.

[9]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[10]  P. Khadikar,et al.  QSAR study on toxicity to aqueous organisms using the PI index. , 2002, Bioorganic & medicinal chemistry.

[11]  M. Ganjali,et al.  QSAR Study of 2‐(1‐Propylpiperidin‐4‐yl)‐1H‐Benzimidazole‐4‐Carboxamide as PARP Inhibitors for Treatment of Cancer , 2008, Chemical biology & drug design.

[12]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification. , 2004 .

[13]  C. Hansch,et al.  On the role of polarizability in QSAR. , 2005, Bioorganic & medicinal chemistry.

[14]  M. Ganjali,et al.  Structural study of 2-(1-oxo-1 H-inden-3-yl)-2H-indene-1,3-dione by DFT calculations, NMR and IR spectroscopy. , 2008, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[15]  D. Hill,et al.  The Biochemistry and Physiology of Tetrahymena , 1972 .

[16]  P. Wilson,et al.  Incidence of coronary heart disease and lipoprotein cholesterol levels. The Framingham Study. , 1986, JAMA.

[17]  H. Sharghi,et al.  QSAR Analysis for ADA upon Interaction with a Series of Adenine Derivatives as Inhibitors , 2004, Nucleosides, nucleotides & nucleic acids.

[18]  Mohammad Reza Ganjali,et al.  Application of GA-MLR, GA-PLS and the DFT quantum mechanical (QM) calculations for the prediction of the selectivity coefficients of a histamine-selective electrode , 2008 .

[19]  J. Hunger,et al.  Optimization and analysis of force field parameters by combination of genetic algorithms and neural networks , 1999, J. Comput. Chem..

[20]  Yi Wang,et al.  A Computational Approach to Botanical Drug Design by Modeling Quantitative Composition–activity Relationship , 2006, Chemical biology & drug design.

[21]  M. Ganjali,et al.  QUANTUM MECHANICAL DESCRIPTION OF THE INTERACTIONS BETWEEN DNA AND 9,10-ANTHRAQUINONE , 2008 .

[22]  M. Ganjali,et al.  Exploring QSARs for Antiviral Activity of 4‐Alkylamino‐6‐(2‐hydroxyethyl)‐2‐methylthiopyrimidines by Support Vector Machine , 2008, Chemical biology & drug design.

[23]  Rajani Giridhar,et al.  3D‐Quantitative Structure–Activity Relationship Studies on Benzothiadiazepine Hydroxamates as Inhibitors of Tumor Necrosis Factor‐α Converting Enzyme , 2008, Chemical biology & drug design.

[24]  Chris L. Waller,et al.  Development and Validation of a Novel Variable Selection Technique with Application to Multidimensional Quantitative Structure—Activity Relationship Studies. , 1999 .

[25]  B. Witherbee,et al.  Chiral N,N-disubstituted trifluoro-3-amino-2-propanols are potent inhibitors of cholesteryl ester transfer protein. , 2002, Journal of medicinal chemistry.

[26]  M. Ganjali,et al.  Experimental and quantum chemical study on the IR, UV and electrode potential of 6-(2,3-dihydro-1,3-dioxo-2-phenyl-1H-inden-2-yl)-2,3-dihydroxybenzaldehyde. , 2008, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[27]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[28]  A. Tall,et al.  Plasma lipid transfer proteins, high-density lipoproteins, and reverse cholesterol transport. , 1998, Annual review of nutrition.

[29]  Shandar Ahmad,et al.  Design and training of a neural network for predicting the solvent accessibility of proteins , 2003, J. Comput. Chem..

[30]  Chris L. Waller,et al.  Development and Validation of a Novel Variable Selection Technique with Application to Multidimensional Quantitative Structure-Activity Relationship Studies , 1999, J. Chem. Inf. Comput. Sci..

[31]  M. Salavati‐Niasari,et al.  Determination of Vanadyl Ions by a New PVC Membrane Sensor Based on N, N'-bis-(Salicylidene)-2,2-Dimethylpropane-1,3-Diamine , 2007, IEEE Sensors Journal.

[32]  M. Ganjali,et al.  Partition Coefficient Prediction of a Large Set of Various Drugs and Poisons by a Genetic Algorithm and Artificial Neural Network , 2008 .

[33]  Corwin Hansch,et al.  Comprehensive medicinal chemistry : the rational design, mechanistic study & therapeutic application of chemical compounds , 1990 .

[34]  Wenjian Wang,et al.  Determination of the spread parameter in the Gaussian kernel for classification and regression , 2003, Neurocomputing.

[35]  M. Ganjali,et al.  Lanthanide Recognition: Monitoring of Praseodymium(III) by a Novel Praseodymium(III) Microsensor Based on N$'$-(Pyridin-2-Ylmethylene)Benzohydrazide , 2007, IEEE Sensors Journal.

[36]  B. Witherbee,et al.  Novel heteroaryl replacements of aromatic 3-tetrafluoroethoxy substituents in trifluoro-3-(tertiaryamino)-2-propanols as potent inhibitors of cholesteryl ester transfer protein. , 2001, Bioorganic & medicinal chemistry letters.

[37]  Dimitris K Agrafiotis,et al.  A QSAR Model of hERG Binding Using a Large, Diverse, and Internally Consistent Training Set , 2006, Chemical biology & drug design.

[38]  E. Benfenati,et al.  QSAR models of quail dietary toxicity based on the graph of atomic orbitals. , 2006, Bioorganic & medicinal chemistry letters.

[39]  Eslam Pourbasheer,et al.  QSRR Study of GC Retention Indices of Essential-Oil Compounds by Multiple Linear Regression with a Genetic Algorithm , 2008 .

[40]  Feng Luan,et al.  Diagnosing Breast Cancer Based on Support Vector Machines , 2003, J. Chem. Inf. Comput. Sci..

[41]  M. Shamsipur,et al.  Prediction of selectivity coefficients of a theophylline-selective electrode using MLR and ANN. , 2006, Talanta.

[42]  B. Witherbee,et al.  Discovery of chiral N,N-disubstituted trifluoro-3-amino-2-propanols as potent inhibitors of cholesteryl ester transfer protein. , 2000, Journal of medicinal chemistry.

[43]  R. Brereton,et al.  Support vector machines for classification and regression. , 2010, The Analyst.

[44]  Jie Yang,et al.  Support Vector Machine In Chemistry , 2004 .

[45]  E. Benfenati,et al.  QSAR models for Daphnia toxicity of pesticides based on combinations of topological parameters of molecular structures. , 2006, Bioorganic & medicinal chemistry.

[46]  Ruisheng Zhang,et al.  Prediction of the Isoelectric Point of an Amino Acid Based on GA-PLS and SVMs , 2004, J. Chem. Inf. Model..

[47]  R. Boggia,et al.  Genetic algorithms as a strategy for feature selection , 1992 .