Quantitative Structure-Activity Relationship Studies of Progesterone Receptor Binding Steroids

The selection of appropriate descriptors is an important step in the successful formulation of quantitative structure-activity relationships (QSARs). This paper compares a number of feature selection routines and mapping methods that are in current use. They include forward stepping regression (FSR), genetic function approximation (GFA), generalized simulated annealing (GSA), and genetic neural network (GNN). On the basis of a data set of steroids of known in vitro binding affinity to the progsterone receptor, a number of QSAR models are constructed. A comparison of the predictive qualities for both training and test compounds demonstrates that the GNN protocol achieves the best results among the 2D QSAR that are considered. Analysis of the choice of descriptors by the GNN method shows that the results are consistent with established SARs on this series of compounds.

[1]  Peter C. Jurs,et al.  Prediction of Aqueous Solubility for a Diverse Set of Heteroatom-Containing Organic Compounds Using a Quantitative Structure-Property Relationship , 1996, J. Chem. Inf. Comput. Sci..

[2]  P A Kollman,et al.  Quantitative relationships between steroid structure and binding to putative progesterone receptors. , 1977, Journal of medicinal chemistry.

[3]  Hxugo Kubiny Variable Selection in QSAR Studies. I. An Evolutionary Algorithm , 1994 .

[4]  Gerald M. Maggiora,et al.  Computational neural networks as model-free mapping devices , 1992, J. Chem. Inf. Comput. Sci..

[5]  Kimito Funatsu,et al.  GA Strategy for Variable Selection in QSAR Studies: GA-Based PLS Analysis of Calcium Channel Antagonists , 1997, J. Chem. Inf. Comput. Sci..

[6]  James J. P. Stewart,et al.  MOPAC: A semiempirical molecular orbital program , 1990, J. Comput. Aided Mol. Des..

[7]  Desire L. Massart,et al.  Random correlation in variable selection for multivariate calibration with a genetic algorithm , 1996 .

[8]  R. Leardi Application of a genetic algorithm to feature selection under full validation conditions and to outlier detection , 1994 .

[9]  Peter C. Jurs,et al.  Prediction of Autoignition Temperatures of Organic Compounds from Molecular Structure , 1997, J. Chem. Inf. Comput. Sci..

[10]  Kimito Funatsu,et al.  GA Strategy for Variable Selection in QSAR Studies: Enhancement of Comparative Molecular Binding Energy Analysis by GA‐Based PLS Method , 1999 .

[11]  Hugo Kubinyi,et al.  Evolutionary variable selection in regression and PLS analyses , 1996 .

[12]  J. Topliss,et al.  Chance factors in studies of quantitative structure-activity relationships. , 1979, Journal of medicinal chemistry.

[13]  Peter C. Jurs,et al.  Prediction of Human Intestinal Absorption of Drug Compounds from Molecular Structure , 1998, J. Chem. Inf. Comput. Sci..

[14]  Brian T. Luke,et al.  Evolutionary Programming Applied to the Development of Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships , 1994, J. Chem. Inf. Comput. Sci..

[15]  G. Teutsch,et al.  Influence of 6-azido and 6-thiocyanato substitution on progestational and corticoid activities and a structure-activity correlation in the delta6-6-substituted progestational series. , 1973, Journal of medicinal chemistry.

[16]  Katsuichiro Komatsu,et al.  Pattern recognition for the study of structure—activity relationships , 1981 .

[17]  Kimito Funatsu,et al.  GA Strategy for Variable Selection in QSAR Studies: GA-Based Region Selection for CoMFA Modeling , 1998, J. Chem. Inf. Comput. Sci..

[18]  John N. Weinstein,et al.  Mining the NCI Anticancer Drug Discovery Databases: Genetic Function Approximation for the QSAR Study of Anticancer Ellipticine Analogues , 1998, J. Chem. Inf. Comput. Sci..

[19]  R. Cramer,et al.  Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. , 1988, Journal of the American Chemical Society.

[20]  Peter C. Jurs,et al.  Prediction of Supercritical Carbon Dioxide Solubility of Organic Compounds from Molecular Structure , 1997, J. Chem. Inf. Comput. Sci..

[21]  Peter C. Jurs,et al.  Prediction of Normal Boiling Points for a Diverse Set of Industrially Important Organic Compounds from Molecular Structure , 1995, J. Chem. Inf. Comput. Sci..

[22]  R. Boggia,et al.  Genetic algorithms as a strategy for feature selection , 1992 .

[23]  D. Manallack,et al.  Statistics using neural networks: chance effects. , 1993, Journal of medicinal chemistry.

[24]  R. Leardi,et al.  Genetic algorithms applied to feature selection in PLS regression: how and when to use them , 1998 .

[25]  Martin Fodslette Møller,et al.  A scaled conjugate gradient algorithm for fast supervised learning , 1993, Neural Networks.

[26]  M. Karplus,et al.  Genetic neural networks for quantitative structure-activity relationships: improvements and application of benzodiazepine affinity for benzodiazepine/GABAA receptors. , 1996, Journal of medicinal chemistry.

[27]  Deborah A. Loughney,et al.  A comparison of progestin and androgen receptor binding using the CoMFA technique , 1992, J. Comput. Aided Mol. Des..

[28]  Peter C. Jurs,et al.  Prediction of Aqueous Solubility of Organic Compounds from Molecular Structure , 1998, J. Chem. Inf. Comput. Sci..

[29]  D. Maddalena,et al.  Prediction of receptor properties and binding affinity of ligands to benzodiazepine/GABAA receptors using artificial neural networks. , 1995, Journal of medicinal chemistry.

[30]  J. Gasteiger,et al.  Automatic generation of 3D-atomic coordinates for organic molecules , 1990 .

[31]  Peter C. Jurs,et al.  Automated Descriptor Selection for Quantitative Structure-Activity Relationships Using Generalized Simulated Annealing , 1995, J. Chem. Inf. Comput. Sci..

[32]  James H. Wikel,et al.  The use of neural networks for variable selection in QSAR , 1993 .

[33]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[34]  K. Funatsu,et al.  GA strategy for variable selection in QSAR studies: GAPLS and D-optimal designs for predictive QSAR model , 1998 .

[35]  John H. Kalivas,et al.  Comparison of Forward Selection, Backward Elimination, and Generalized Simulated Annealing for Variable Selection , 1993 .

[36]  Thomas R. Kowar,et al.  Genetic Function Approximation Experimental Design (GFAXD): A New Method for Experimental Design , 1998, J. Chem. Inf. Comput. Sci..

[37]  Sung-Sau So,et al.  A comparative study of ligand-receptor complex binding affinity prediction methods based on glycogen phosphorylase inhibitors , 1999, J. Comput. Aided Mol. Des..

[38]  M Karplus,et al.  Evolutionary optimization in quantitative structure-activity relationship: an application of genetic neural networks. , 1996, Journal of medicinal chemistry.

[39]  Kimito Funatsu,et al.  GA Strategy for Variable Selection in QSAR Studies: Application of GA-Based Region Selection to a 3D-QSAR Study of Acetylcholinesterase Inhibitors , 1999, J. Chem. Inf. Comput. Sci..

[40]  A J Solo,et al.  Quantitative structure-activity relationships among steroids. Investigations of the use of steric parameters. , 1976, Journal of medicinal chemistry.

[41]  S. So,et al.  Application of neural networks: quantitative structure-activity relationships of the derivatives of 2,4-diamino-5-(substituted-benzyl)pyrimidines as DHFR inhibitors. , 1992, Journal of medicinal chemistry.

[42]  J P Raynaud,et al.  Correspondence analysis applied to steroid receptor binding. , 1986, Journal of medicinal chemistry.

[43]  J. Shao Linear Model Selection by Cross-validation , 1993 .

[44]  E W Bergink,et al.  Binding of progestagens to receptor proteins in MCF-7 cells. , 1983, Journal of steroid biochemistry.

[45]  A.J.v.d. Broek,et al.  Strategy in drug reseabch. Synthesis and study of the psogestational and ovulation inhibitory activity of a series of 11β-substituted-17α-ethynyl-4-estren-17β-ols , 1977, Steroids.

[46]  J P Raynaud,et al.  Binding of steroids to the progestin and glucocorticoid receptors analyzed by correspondence analysis. , 1988, Journal of medicinal chemistry.

[47]  H. Kubinyi Variable Selection in QSAR Studies. II. A Highly Efficient Combination of Systematic Search and Evolution , 1994 .

[48]  D. Rogers,et al.  Receptor surface models. 2. Application to quantitative structure-activity relationships studies. , 1995, Journal of medicinal chemistry.

[49]  Anton J. Hopfinger,et al.  Application of Genetic Function Approximation to Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships , 1994, J. Chem. Inf. Comput. Sci..

[50]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[51]  V. J. Van Geerestein Structure of the ethanol solvate of 13-ethyl-11β-methyl-18-norlynestrenol , 1988 .