Evaluation of a neural networks QSAR method based on ligand representation using substituent descriptors. Application to HIV-1 protease inhibitors.

We present here a neural networks method designed to predict biological activity based on a local representation of the ligand. The compounds of the series are represented by a vector mapping for each of four substituent properties: volume, log P, dipole moment and a simple 'steric' parameter relating to its shape. This ligand representation was tested using neural networks on a set of 42 cyclic-urea derivatives, inhibiting HIV-1 protease. The leave-one-out cross-validation using all descriptors in the input gave a correlation factor between prediction and experiment of 0.76 for the overall set and 0.88 when three outliers were left out. To rank the significance of the four descriptors, we further tested all combinations of two and three parameters for each substituent, using two disjunctive testing sets of five inhibitors. In these sets, vectors with extreme descriptor values were used either in the training or the testing set (sets A and B, respectively). The method is a very good interpolator (set A, 95+/-2% accuracy) but a less effective extrapolator (set B, 85+/-2% accuracy). Generally, the combinations including the 'steric' parameter predict better than average, while those containing the volume are less effective. The best prediction, 98.8+/-1.2%, was obtained when log P, the dipole and the steric parameter were used on set A. At the opposite end, the lowest ranked descriptor set was obtained when replacing log P with the volume, giving 92.3+/-6.7% accuracy over the set A.

[1]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[2]  B. Rost,et al.  Improved prediction of protein secondary structure by use of sequence profiles and neural networks. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Ram Samudrala,et al.  Rappertk: a versatile engine for discrete restraint-based conformational sampling of macromolecules , 2007, BMC Structural Biology.

[4]  Zheng Rong Yang,et al.  Reduced bio basis function neural network for identification of protein phosphorylation sites: comparison with pattern recognition algorithms , 2004, Comput. Biol. Chem..

[5]  Tarun Khanna,et al.  Foundations of neural networks , 1990 .

[6]  S H Kim,et al.  Prediction of protein folding class from amino acid composition , 1993, Proteins.

[7]  D. Zakarya,et al.  Structure-toxicity relationships study of a series of organophosphorus insecticides , 2002, Journal of molecular modeling.

[8]  Gajendra Pal Singh Raghava,et al.  Prediction of β‐turns in proteins from multiple alignment using neural network , 2003, Protein science : a publication of the Protein Society.

[9]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[10]  Jay W. Ponder,et al.  Analysis and Application of Potential Energy Smoothing and Search Methods for Global Optimization , 1998 .

[11]  J. Hulten,et al.  Synthesis and comparative molecular field analysis (CoMFA) of symmetric and nonsymmetric cyclic sulfamide HIV-1 protease inhibitors. , 2001, Journal of medicinal chemistry.

[12]  A. Ghose,et al.  Atomic physicochemical parameters for three dimensional structure directed quantitative structure‐activity relationships III: Modeling hydrophobic interactions , 1988 .

[13]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[14]  Nikolaj Blom,et al.  BIOINFORMATICS APPLICATIONS NOTE Sequence analysis NetAcet: prediction of N-terminal acetylation sites , 2004 .

[15]  P A Kollman,et al.  Determination of the relative binding free energies of peptide inhibitors to the HIV-1 protease. , 1991, Journal of medicinal chemistry.

[16]  D. Connelly,et al.  Cross‐validation of protein structural class prediction using statistical clustering and neural networks , 1993, Protein science : a publication of the Protein Society.

[17]  S. Rick,et al.  Reaction path and free energy calculations of the transition between alternate conformations of HIV‐1 protease , 1998, Proteins.

[18]  T Lengauer,et al.  The particle concept: placing discrete water molecules during protein‐ligand docking predictions , 1999, Proteins.

[19]  Bahram Hemmateenejad,et al.  Genetic Algorithm Applied to the Selection of Factors in Principal Component-Artificial Neural Networks: Application to QSAR Study of Calcium Channel Antagonist Activity of 1, 4-Dihydropyridines (Nifedipine Analogous) , 2003, J. Chem. Inf. Comput. Sci..

[20]  R. Lohmann,et al.  A neural network model for the prediction of membrane‐spanning amino acid sequences , 1994, Protein science : a publication of the Protein Society.

[21]  Peter C. Jurs,et al.  Development of Quantitative Structure-Activity Relationship and Classification Models for a Set of Carbonic Anhydrase Inhibitors , 2002, J. Chem. Inf. Comput. Sci..

[22]  D. Manallack,et al.  Analysis of linear and nonlinear QSAR data using neural networks. , 1994, Journal of medicinal chemistry.

[23]  Pengyu Y. Ren,et al.  Polarizable Atomic Multipole Water Model for Molecular Mechanics Simulation , 2003 .

[24]  B. G. Rao,et al.  Structural and Kinetic Analyses of the Protease from an Amprenavir-Resistant Human Immunodeficiency Virus Type 1 Mutant Rendered Resistant to Saquinavir and Resensitized to Amprenavir , 2000, Journal of Virology.

[25]  Quantitative structure-activity relationships of some HIV-protease inhibitors. , 1999, Journal of enzyme inhibition.

[26]  Benny Lautrup,et al.  Neural Networks: Computers With Intuition , 1990 .

[27]  D. Zakarya,et al.  QSAR for anti-HIV activity of HEPT derivatives , 2002, SAR and QSAR in environmental research.

[28]  M. Jaskólski,et al.  Conserved folding in retroviral proteases: crystal structure of a synthetic HIV-1 protease. , 1989, Science.

[29]  Ray Luo,et al.  Comparison of generalized born and poisson models: Energetics and dynamics of HIV protease , 2000 .

[30]  R M Stroud,et al.  Domain flexibility in retroviral proteases: structural implications for drug resistant mutations. , 1998, Biochemistry.

[31]  Anton J. Hopfinger,et al.  Receptor-Independent 4D-QSAR Analysis of a Set of Norstatine Derived Inhibitors of HIV-1 Protease , 2003, J. Chem. Inf. Comput. Sci..

[32]  Johannes Schuchhardt,et al.  Adaptive encoding neural networks for the recognition of human signal peptide cleavage sites , 2000, Bioinform..

[33]  D. Joseph-McCarthy,et al.  Automated generation of MCSS‐derived pharmacophoric DOCK site points for searching multiconformation databases , 2003, Proteins.

[34]  S. Brunak,et al.  Prediction, conservation analysis, and structural characterization of mammalian mucin-type O-glycosylation sites. , 2005, Glycobiology.

[35]  Carlo W Boutton,et al.  Genotype dependent QSAR for HIV-1 protease inhibition. , 2005, Journal of medicinal chemistry.

[36]  David M. Skapura,et al.  Building neural networks , 1995 .

[37]  G Schneider,et al.  Artificial neural networks for computer-based molecular design. , 1998, Progress in biophysics and molecular biology.

[38]  G. Schneider,et al.  Development of artificial neural filters for pattern recognition in protein sequences , 1993, Journal of Molecular Evolution.

[39]  Rajni Garg,et al.  A mechanistic study of 3-aminoindazole cyclic urea HIV-1 protease inhibitors using comparative QSAR. , 2004, Bioorganic & medicinal chemistry.

[40]  A Wlodawer,et al.  Inhibitors of HIV-1 protease: a major success of structure-assisted drug design. , 1998, Annual review of biophysics and biomolecular structure.

[41]  P. Lam,et al.  Counteracting HIV-1 protease drug resistance: structural analysis of mutant proteases complexed with XV638 and SD146, cyclic urea amides with broad specificities. , 1998, Biochemistry.

[42]  A. K. Madan,et al.  Superpendentic Index: A Novel Topological Descriptor for Predicting Biological Activity , 1999, J. Chem. Inf. Comput. Sci..

[43]  Cathy H. Wu,et al.  Motif identification neural design for rapid and sensitive protein family search , 1996, Comput. Appl. Biosci..

[44]  Cathy H. Wu,et al.  Neural networks and genome informatics , 2000 .

[45]  Ajit Narayanan,et al.  Mining viral protease data to extract cleavage knowledge , 2002, ISMB.

[46]  C. Hutchison,et al.  Analysis of retroviral protease cleavage sites reveals two types of cleavage sites and the structural requirements of the P1 amino acid. , 1991, The Journal of biological chemistry.

[47]  Klaus L.E. Kaiser,et al.  The use of neural networks in QSARs for acute aquatic toxicological endpoints , 2003 .

[48]  Konstantin V. Balakin,et al.  Structure-Based versus Property-Based Approaches in the Design of G-Protein-Coupled Receptor-Targeted Libraries , 2003, J. Chem. Inf. Comput. Sci..

[49]  Richard Dybowski,et al.  Clinical applications of artificial neural networks: Theory , 2001 .

[50]  Peter C. Jurs,et al.  Prediction of Glycine/NMDA Receptor Antagonist Inhibition from Molecular Structure , 2002, J. Chem. Inf. Comput. Sci..

[51]  Paolo Carloni,et al.  Conformational flexibility of the catalytic Asp dyad in HIV‐1 protease: An ab initio study on the free enzyme , 2000, Proteins.

[52]  M. Flonta,et al.  Correlation between the predicted and the observed biological activity of the symmetric and nonsymmetric cyclic urea derivatives used as HIV‐1 protease inhibitors. A 3D‐QSAR‐CoMFA method for new antiviral drug design , 2003, Journal of cellular and molecular medicine.

[53]  Rajni Garg,et al.  HIV-1 protease inhibitors: a comparative QSAR analysis. , 2003, Current medicinal chemistry.

[54]  M. Karplus,et al.  Genetic neural networks for quantitative structure-activity relationships: improvements and application of benzodiazepine affinity for benzodiazepine/GABAA receptors. , 1996, Journal of medicinal chemistry.

[55]  I. Weber,et al.  Molecular mechanics analysis of inhibitor binding to HIV-1 protease. , 1992, Protein engineering.

[56]  Rajarshi Guha,et al.  Development of Linear, Ensemble, and Nonlinear Models for the Prediction and Interpretation of the Biological Activity of a Set of PDGFR Inhibitors , 2004, J. Chem. Inf. Model..

[57]  A. Gronenborn,et al.  Folded Monomer of HIV-1 Protease* , 2001, The Journal of Biological Chemistry.

[58]  O. Lund,et al.  novel sequence representations Reliable prediction of T-cell epitopes using neural networks with , 2003 .

[59]  M. Shamsipur,et al.  Multicomponent acid–base titration by principal component-artificial neural network calibration , 2002 .

[60]  Lu Wang,et al.  Does a diol cyclic urea inhibitor of HIV-1 protease bind tighter than its corresponding alcohol form? A study by free energy perturbation and continuum electrostatics calculations , 2001, J. Comput. Aided Mol. Des..

[61]  S Brunak,et al.  Protein structures from distance inequalities. , 1993, Journal of molecular biology.

[62]  R. Erb,et al.  Introduction to Backpropagation Neural Network Computation , 1993, Pharmaceutical Research.

[63]  I. Luque,et al.  Molecular basis of resistance to HIV-1 protease inhibition: a plausible hypothesis. , 1998, Biochemistry.

[64]  Ted von Hippel,et al.  Three-dimensional Spectral Classification of Low-Metallicity Stars Using Artificial Neural Networks , 2001, astro-ph/0107409.

[65]  J. Sutherland,et al.  A comparison of methods for modeling quantitative structure-activity relationships. , 2004, Journal of medicinal chemistry.

[66]  T. Niwa Prediction of biological targets using probabilistic neural networks and atom-type descriptors. , 2004, Journal of medicinal chemistry.

[67]  Anton J. Hopfinger,et al.  A Simple Clustering Technique To Improve QSAR Model Selection and Predictivity: Application to a Receptor Independent 4D-QSAR Analysis of Cyclic Urea Derived Inhibitors of HIV-1 Protease , 2003, J. Chem. Inf. Comput. Sci..

[68]  M. Navia,et al.  Three-dimensional structure of aspartyl protease from human immunodeficiency virus HIV-1 , 1989, Nature.

[69]  P. Jadhav,et al.  Preparation and structure-activity relationship of novel P1/P1'-substituted cyclic urea-based human immunodeficiency virus type-1 protease inhibitors. , 1996, Journal of medicinal chemistry.

[70]  Chong-Hwan Chang,et al.  Cyclic HIV protease inhibitors: synthesis, conformational analysis, P2/P2' structure-activity relationship, and molecular recognition of cyclic ureas. , 1996, Journal of medicinal chemistry.

[71]  Sorin Draghici,et al.  Predicting HIV drug resistance with neural networks , 2003, Bioinform..

[72]  Peter C. Jurs,et al.  Classification of Inhibitors of Protein Tyrosine Phosphatase 1B Using Molecular Structure Based Descriptors , 2003, J. Chem. Inf. Comput. Sci..

[73]  David A Winkler,et al.  Neural networks as robust tools in drug lead discovery and development , 2004, Molecular biotechnology.

[74]  A. Bax,et al.  Long-range magnetization transfer between uncoupled nuclei by dipole-dipole cross-correlated relaxation: a precise probe of beta-sheet geometry in proteins. , 2002, Journal of the American Chemical Society.

[75]  Cathy H. Wu,et al.  Gene classification artificial neural system , 1995, Proceedings First International Symposium on Intelligence in Neural and Biological Systems. INBS'95.

[76]  Carlos E. Padilla,et al.  MBO(N)D: A multibody method for long‐time molecular dynamics simulations , 2000 .

[77]  J. Topliss,et al.  Chance factors in studies of quantitative structure-activity relationships. , 1979, Journal of medicinal chemistry.

[78]  Ulf Norinder,et al.  Refinement of Catalyst hypotheses using simplex optimisation , 2000, J. Comput. Aided Mol. Des..

[79]  M G Ford,et al.  Selecting compounds for focused screening using linear discriminant analysis and artificial neural networks. , 2004, Journal of molecular graphics & modelling.

[80]  Gordon M. Crippen,et al.  Atomic physicochemical parameters for three-dimensional-structure-directed quantitative structure-activity relationships. 2. Modeling dispersive and hydrophobic interactions , 1987, J. Chem. Inf. Comput. Sci..

[81]  Johann Gasteiger,et al.  Neural Networks for Chemists: An Introduction , 1993 .

[82]  S. Miertus,et al.  A computational study of the resistance of HIV-1 aspartic protease to the inhibitors ABT-538 and VX-478 and design of new analogues. , 1998, Biochemical and biophysical research communications.

[83]  B. Rost PHD: predicting one-dimensional protein structure by profile-based neural networks. , 1996, Methods in enzymology.

[84]  John P. Overington,et al.  X-ray analysis of HIV-1 proteinase at 2.7 Å resolution confirms structural homology among retroviral enzymes , 1989, Nature.

[85]  Ian Spence,et al.  Selective descriptor pruning for QSAR/QSPR studies using artificial neural networks , 2003, J. Comput. Chem..

[86]  I. Muchnik,et al.  Prediction of protein folding class using global description of amino acid sequence. , 1995, Proceedings of the National Academy of Sciences of the United States of America.