The Comparative Molecular Surface Analysis (CoMSA) with Modified Uniformative Variable Elimination-PLS (UVE-PLS) Method: Application to the Steroids Binding the Aromatase Enzyme

The application of the CoMSA method to analyze 3D QSAR of 50 steroid aromatase inhibitors is described. The 3D QSAR model obtained, reaching a value of cross-validated q(2) = 0.96 (s = 0.31), significantly outperforms those reported in the literature for the CoMFA or CoSA (CoSASA). It is shown that the Uniformative Variable Elimination UVE-PLS or modified iterative UVE procedure (IVE-PLS) can be used for indicating the regions contributing to the binding activity. Thus, after separating the series into two groups of the training and test molecules quite correct external predictions result from the processing of the training set. We proved that the procedure of the data elimination provides stable results, if tested in 50 random runs of the IVE-PLS-CoMSA with different training/test sets. Depending upon the procedure used the quality of the predictions for 25 test molecules is given by SDEP = sum(y(pred)-y(obs))(2)/n)(1/2) = 0.321 - 0.782.

[1]  J. Zupan,et al.  REPRESENTATION OF MOLECULAR ELECTROSTATIC POTENTIALS BY TOPOLOGICAL FEATURE MAPS , 1994 .

[2]  M. Karelson,et al.  Structurally diverse quantitative structure--property relationship correlations of technologically relevant physical properties , 2000, Journal of chemical information and computer sciences.

[3]  J. Gasteiger,et al.  ITERATIVE PARTIAL EQUALIZATION OF ORBITAL ELECTRONEGATIVITY – A RAPID ACCESS TO ATOMIC CHARGES , 1980 .

[4]  D. Massart,et al.  Elimination of uninformative variables for multivariate calibration. , 1996, Analytical chemistry.

[5]  P. Törönen,et al.  Analysis of gene expression data using self‐organizing maps , 1999, FEBS letters.

[6]  A. Tropsha,et al.  Beware of q 2 , 2002 .

[7]  N M Luscombe,et al.  What is Bioinformatics? A Proposed Definition and Overview of the Field , 2001, Methods of Information in Medicine.

[8]  Johann Gasteiger,et al.  Berechnung der Ladungsverteilung in konjugierten Systemen durch eine Quantifizierung des Mesomeriekonzeptes , 1985 .

[9]  J. Gasteiger,et al.  Calculation of the Charge Distribution in Conjugated Systems by a Quantification of the Resonance Concept , 1985 .

[10]  Peter C. Jurs,et al.  Prediction of Supercritical Carbon Dioxide Solubility of Organic Compounds from Molecular Structure , 1997, J. Chem. Inf. Comput. Sci..

[11]  Jaroslaw Polanski,et al.  The Comparative Molecular Surface Analysis (COMSA): A Novel Tool for Molecular Design , 2000, Comput. Chem..

[12]  Jaroslaw Polanski,et al.  The Comparative Molecular Surface Analysis (COMSA) — A Nongrid 3D QSAR Method by a Coupled Neural Network and PLS System: Predicting pKa Values of Benzoic and Alkanoic Acids. , 2002 .

[13]  H. Kubinyi QSAR and 3D QSAR in drug design Part 1: methodology , 1997 .

[14]  Dan A. Buzatu,et al.  13C NMR Quantitative Spectrometric Data-Activity Relationship (QSDAR) Models of Steroids Binding the Aromatase Enzyme , 2001, J. Chem. Inf. Comput. Sci..

[15]  Desire L. Massart,et al.  Detection of inhomogeneities in sets of NIR spectra , 1996 .

[16]  A. Brazma,et al.  Gene expression data analysis , 2000, FEBS letters.

[17]  H. Kubinyi QSAR and 3D QSAR in drug design Part 2: applications and problems , 1997 .

[18]  Jaroslaw Polanski The Receptor‐Like Neural Network for Modeling Corticosteroid and Testosterone Binding Globulins. , 1997 .

[19]  Jens Sadowski,et al.  The Use of Self-organizing Neural Networks in Drug Design , 2002 .

[20]  J. Gasteiger,et al.  FROM ATOMS AND BONDS TO THREE-DIMENSIONAL ATOMIC COORDINATES : AUTOMATIC MODEL BUILDERS , 1993 .

[21]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[22]  Gerhard Klebe,et al.  Comparison of Automatic Three-Dimensional Model Builders Using 639 X-ray Structures , 1994, J. Chem. Inf. Comput. Sci..

[23]  Jaroslaw Polanski,et al.  The Comparative Molecular Surface Analysis (COMSA) - A Nongrid 3D QSAR Method by a Coupled Neural Network and PLS System: Predicting pKa Values of Benzoic and Alkanoic Acids , 2002, J. Chem. Inf. Comput. Sci..

[24]  J Polanski,et al.  Self-organizing neural networks for screening and development of novel artificial sweetener candidates. , 2000, Combinatorial chemistry & high throughput screening.

[25]  Johann Gasteiger,et al.  The comparison of geometric and electronic properties of molecular surfaces by neural networks: Application to the analysis of corticosteroid-binding globulin activity of steroids , 1996, J. Comput. Aided Mol. Des..

[26]  H. Kubinyi QSAR : Hansch analysis and related approaches , 1993 .