Alignment-independent technique for 3D QSAR analysis

Molecular biochemistry is controlled by 3D phenomena but structure–activity models based on 3D descriptors are infrequently used for large data sets because of the computational overhead for determining molecular conformations. A diverse dataset of 146 androgen receptor binders was used to investigate how different methods for defining molecular conformations affect the performance of 3D-quantitative spectral data activity relationship models. Molecular conformations tested: (1) global minimum of molecules’ potential energy surface; (2) alignment-to-templates using equal electronic and steric force field contributions; (3) alignment using contributions “Best-for-Each” template; (4) non-energy optimized, non-aligned (2D > 3D). Aggregate predictions from models were compared. Highest average coefficients of determination ranged from RTest2 = 0.56 to 0.61. The best model using 2D > 3D (imported directly from ChemSpider) produced RTest2 = 0.61. It was superior to energy-minimized and conformation-aligned models and was achieved in only 3–7 % of the time required using the other conformation strategies. Predictions averaged from models built on different conformations achieved a consensus RTest2 = 0.65. The best 2D > 3D model was analyzed for underlying structure–activity relationships. For the compound strongest binding to the androgen receptor, 10 substructural features contributing to binding were flagged. Utility of 2D > 3D was compared for two other activity endpoints, each modeling a medium sized data set. Results suggested that large scale, accurate predictions using 2D > 3D SDAR descriptors may be produced for interactions involving endocrine system nuclear receptors and other data sets in which strongest activities are produced by fairly inflexible substrates.

[1]  G Patlewicz,et al.  Toxmatch–a new software tool to aid in the development and evaluation of chemically similar groups , 2008, SAR and QSAR in environmental research.

[2]  W. Bremser Hose — a novel substructure code , 1978 .

[3]  H. Fang,et al.  Comparative molecular field analysis (CoMFA) model using a large diverse set of natural, synthetic and environmental chemicals for binding to the androgen receptor , 2003, SAR and QSAR in environmental research.

[4]  W. V. van IJcken,et al.  Modulation of Androgen Receptor Signaling in Hormonal Therapy-Resistant Prostate Cancer Cell Lines , 2011, PloS one.

[5]  Svetoslav H. Slavov,et al.  Complementary PLS and KNN algorithms for improved 3D-QSDAR consensus modeling of AhR binding , 2013, Journal of Cheminformatics.

[6]  Svetoslav H. Slavov,et al.  Computational identification of a phospholipidosis toxicophore using (13)C and (15)N NMR-distance based fingerprints. , 2014, Bioorganic & medicinal chemistry.

[7]  Svetoslav H. Slavov,et al.  Partial least square and k‐nearest neighbor algorithms for improved 3D quantitative spectral data–activity relationship consensus modeling of acute toxicity , 2014, Environmental toxicology and chemistry.

[8]  Alexander Golbraikh,et al.  Does Rational Selection of Training and Test Sets Improve the Outcome of QSAR Modeling? , 2012, J. Chem. Inf. Model..

[9]  Svetoslav H. Slavov,et al.  13C NMR-Distance Matrix Descriptors: Optimal Abstract 3D Space Granularity for Predicting Estrogen Binding , 2012, J. Chem. Inf. Model..

[10]  Svetoslav H. Slavov,et al.  Erratum: 13C NMR - Distance Matrix Descriptors: Optimal Abstract 3D Space Granularity for Predicting Estrogen Binding , 2012, J. Chem. Inf. Model..

[11]  Deborah A. Loughney,et al.  A comparison of progestin and androgen receptor binding using the CoMFA technique , 1992, J. Comput. Aided Mol. Des..

[12]  D. Feldman,et al.  The development of androgen-independent prostate cancer , 2001, Nature Reviews Cancer.

[13]  Krzysztof Józwiak,et al.  X-ray Crystallographic Structures as a Source of Ligand Alignment in 3D-QSAR , 2013, J. Chem. Inf. Model..

[14]  J. Meiler,et al.  Using neural networks for (13)c NMR chemical shift prediction-comparison with traditional methods. , 2002, Journal of magnetic resonance.

[15]  Daniel Cappel,et al.  Exploring conformational search protocols for ligand-based virtual screening and 3-D QSAR modeling , 2015, Journal of Computer-Aided Molecular Design.

[16]  Hugo Kubinyi,et al.  3D QSAR in drug design : theory, methods and applications , 2000 .

[17]  J. Dalton,et al.  Chemistry and Structural Biology of Androgen Receptor , 2005 .

[18]  Sean Ekins,et al.  Challenges Predicting Ligand-Receptor Interactions of Promiscuous Proteins: The Nuclear Receptor PXR , 2009, PLoS Comput. Biol..

[19]  Lei Zhang,et al.  How to generate reliable and predictive CoMFA models. , 2011, Current medicinal chemistry.

[20]  Lemont B. Kier,et al.  An Index of Molecular Flexibility from Kappa Shape Attributes , 1989 .

[21]  D. Tindall,et al.  Androgen Action in Prostate Cancer , 2009, Hormones & cancer.

[22]  Simon K. Kearsley,et al.  An alternative method for the alignment of molecular structures: Maximizing electrostatic and steric overlap , 1990 .

[23]  Henri Xhaard,et al.  SVM Classification and CoMSIA Modeling of UGT1A6 Interacting Molecules , 2014, J. Chem. Inf. Model..

[24]  A. Doweyko,et al.  QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS FOR 2-((PHENYLMETHYL)SULFONYL)PYRIDINE 1-OXIDE HERBICIDES , 1983 .

[25]  G. Prins,et al.  Disruption of androgen receptor signaling in males by environmental chemicals , 2011, The Journal of Steroid Biochemistry and Molecular Biology.

[26]  William C. Eckelman,et al.  Molecular Nuclear Medicine , 2003, Springer Berlin Heidelberg.

[27]  Jonas S Almeida,et al.  Predictive non-linear modeling of complex data by artificial neural networks. , 2002, Current opinion in biotechnology.

[28]  Armando Rossello,et al.  Multitemplate Alignment Method for the Development of a Reliable 3D-QSAR Model for the Analysis of MMP3 Inhibitors , 2009, J. Chem. Inf. Model..

[29]  A. Soto,et al.  Developmental effects of endocrine-disrupting chemicals in wildlife and humans , 1994 .