New approach to pharmacophore mapping and QSAR analysis using inductive logic programming. Application to thermolysin inhibitors and glycogen phosphorylase B inhibitors.

A key problem in QSAR is the selection of appropriate descriptors to form accurate regression equations for the compounds under study. Inductive logic programming (ILP) algorithms are a class of machine-learning algorithms that have been successfully applied to a number of SAR problems. Unlike other QSAR methods, which use attributes to describe chemical structure, ILP uses relations. This gives ILP the advantages of not requiring explicit superimposition of individual compounds in a dataset, of dealing naturally with multiple conformations, and of using a language much closer to that used normally by chemists. We unify ILP and standard regression techniques to give a QSAR method that has the strength of ILP at describing steric structure with the familiarity and power of regression methods. Complex pharmacophores, correlating with activity, were identified and used as new indicator variables, along with the comparative molecular field analysis (CoMFA) prediction, to form predictive regression equations. We compared the formation of 3D-QSARs using standard CoMFA with the use of ILP on the well-studied thermolysin zinc protease inhibitor dataset and a glycogen phosphorylase inhibitor dataset. In each case the addition of ILP variables produced statistically better results (P < 0.01 for thermolysin and P < 0.05 for GP datasets) than the CoMFA analysis. Moreover, the new ILP variables were not found to increase the complexity of the final QSAR equations and gave possible insight into the binding mechanism of the ligand-protein complex under study.

[1]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[2]  R. M. Muir,et al.  Correlation of Biological Activity of Phenoxyacetic Acids with Hammett Substituent Constants and Partition Coefficients , 1962, Nature.

[3]  H. Tsuzuki,et al.  Thermolysin: kinetic study with oligopeptides. , 1970, European journal of biochemistry.

[4]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[5]  J. Powers,et al.  Peptide hydroxamic acids as inhibitors of thermolysin. , 1978, Biochemistry.

[6]  J. Powers,et al.  Inhibition of thermolysin and carboxypeptidase A by phosphoramidates. , 1979, Biochemistry.

[7]  Binding of the biproduct analog L-benzylsuccinic acid to thermolysin determined by X-ray crystallography. , 1979, The Journal of biological chemistry.

[8]  B. Matthews,et al.  Binding of hydroxamic acid inhibitors to crystalline thermolysin suggests a pentacoordinate zinc intermediate in catalysis. , 1982, Biochemistry.

[9]  B. Matthews,et al.  Structure of a mercaptan-thermolysin complex illustrates mode of inhibition of zinc proteases by substrate-analogue mercaptans. , 1982, Biochemistry.

[10]  P. Bartlett,et al.  Phosphonamidates as transition-state analogue inhibitors of thermolysin. , 1983, Biochemistry.

[11]  B. Matthews,et al.  An interactive computer graphics study of thermolysin-catalyzed peptide cleavage and inhibition by N-carboxymethyl dipeptides. , 1984, Biochemistry.

[12]  Eamonn F. Healy,et al.  Development and use of quantum mechanical molecular models. 76. AM1: a new general purpose quantum mechanical molecular model , 1985 .

[13]  D E Tronrud,et al.  Crystallographic structural analysis of phosphoramidates as inhibitors and transition-state analogs of thermolysin. , 1989, European journal of biochemistry.

[14]  H M Holden,et al.  Slow- and fast-binding inhibitors of thermolysin display different modes of binding: crystallographic analysis of extended phosphonamidate transition-state analogues. , 1989, Biochemistry.

[15]  D. E. Patterson,et al.  Crossvalidation, Bootstrapping, and Partial Least Squares Compared with Multiple Regression in Conventional QSAR Studies , 1988 .

[16]  G Klopman,et al.  Computer automated structure evaluation (CASE): a study of inhibitors of the thermolysin enzyme. , 1989, Journal of theoretical biology.

[17]  Garland R. Marshall,et al.  3D-QSAR of angiotensin-converting enzyme and thermolysin inhibitors: A comparison of CoMFA models based on deduced and experimentally determined active site geometries , 1993 .

[18]  D. R. Holland,et al.  Inhibition of thermolysin and neutral endopeptidase 24.11 by a novel glutaramide derivative: X-ray structure determination of the thermolysin-inhibitor complex. , 1994, Biochemistry.

[19]  Ian H. Witten,et al.  WEKA: a machine learning workbench , 1994, Proceedings of ANZIIS '94 - Australian New Zealnd Intelligent Information Systems Conference.

[20]  Jonathan D. Hirst,et al.  Quantitative structure-activity relationships by neural networks and inductive logic programming. I. The inhibition of dihydrofolate reductase by pyrimidines , 1994, J. Comput. Aided Mol. Des..

[21]  Jonathan D. Hirst,et al.  Quantitative structure-activity relationships by neural networks and inductive logic programming. II. The inhibition of dihydrofolate reductase by triazines , 1994, J. Comput. Aided Mol. Des..

[22]  L. Johnson,et al.  Glucose analogue inhibitors of glycogen phosphorylase: from crystallographic analysis to drug prediction using GRID force-field and GOLPE variable selection. , 1995, Acta crystallographica. Section D, Biological crystallography.

[23]  L. Johnson,et al.  POTENT INHIBITION OF GLYCOGEN PHOSPHORYLASE BY A SPIROHYDANTOIN OF GLUCOPYRANOSE : FIRST PYRANOSE ANALOGUES OF HYDANTOCIDIN , 1995 .

[24]  Ashwin Srinivasan,et al.  Drug Design by Machine Learning , 1995, Machine Intelligence 15.

[25]  L. Johnson,et al.  The structure of a glycogen phosphorylase glucopyranose spirohydantoin complex at 1.8 Å resolution and 100 K: The role of the water structure and its contribution to binding , 1998, Protein science : a publication of the Protein Society.

[26]  Shen Wang,et al.  Construction of a Virtual High Throughput Screen by 4D-QSAR Analysis: Application to a Combinatorial Library of Glucose Inhibitors of Glycogen Phosphorylase b , 1999, J. Chem. Inf. Comput. Sci..

[27]  P Venkatarangan,et al.  Prediction of ligand-receptor binding thermodynamics by free energy force field three-dimensional quantitative structure-activity relationship analysis: applications to a set of glucose analogue inhibitors of glycogen phosphorylase. , 1999, Journal of medicinal chemistry.

[28]  Anton J. Hopfinger,et al.  Prediction of Ligand-Receptor Binding Free Energy by 4D-QSAR Analysis: Application to a Set of Glucose Analogue Inhibitors of Glycogen Phosphorylase , 1999, J. Chem. Inf. Comput. Sci..

[29]  Ashwin Srinivasan,et al.  An assessment of submissions made to the Predictive Toxicology Evaluation Challenge , 1999, IJCAI.

[30]  Ashwin Srinivasan,et al.  Pharmacophore Discovery Using the Inductive Logic Programming System PROGOL , 1998, Machine Learning.