Chapter 7 Variable Selection QSAR Modeling, Model Validation, and Virtual Screening

Publisher Summary This chapter discusses and illustrates variable selection that rigorously validate quantitative structure–activity relationship (QSAR) models with statistically significant external predictive power ultimately afford their application as reliable virtual screening tools for database mining or chemical library design. Several criteria of model robustness have been presented in the chapter, particularly emphasizing on the external predictive power of the model. The traditional LOO q 2 is insufficient statistical characteristic of model predictive power. Thus, although low values of q 2 almost certainly indicate that the model is not expected to be predictive externally, high values of this parameter alone also provide no guarantee that the underlying models can be used for the reliable external prediction. The application of these principles to the development of the predictive QSAR models of anticonvulsant agents followed by the use of these models for database mining is shown to afford the discovery of novel anticonvulsant agents. The robustness and external predictive power and applicability domain as the most important parameters of QSAR models shall increase their practical use as reliable virtual screening tools for pharmaceutical lead identification.

[1]  Abby L. Parrill,et al.  Rational drug design : novel methodology and practical applications , 1999 .

[2]  Ulf Norinder,et al.  Single and domain mode variable selection in 3D QSAR applications , 1996 .

[3]  Tudor I. Oprea,et al.  3D QSAR Modeling in Drug Design , 2003 .

[4]  A Tropsha,et al.  Identification of the descriptor pharmacophores using variable selection QSAR: applications to database mining. , 2001, Current pharmaceutical design.

[5]  Nikolai S. Zefirov,et al.  QSAR for Boiling Points of "Small" Sulfides. Are the "High-Quality Structure-Property-Activity Regressions" the Real High Quality QSAR Models? , 2001, J. Chem. Inf. Comput. Sci..

[6]  M Karplus,et al.  Evolutionary optimization in quantitative structure-activity relationship: an application of genetic neural networks. , 1996, Journal of medicinal chemistry.

[7]  Ramon Carbó-Dorca,et al.  Modeling Antimalarial Activity: Application of Kinetic Energy Density Quantum Similarity Measures as Descriptors in QSAR , 2000, J. Chem. Inf. Comput. Sci..

[8]  Svante Wold,et al.  Partial least-squares method for spectrofluorimetric analysis of mixtures of humic acid and lignin sulfonate , 1983 .

[9]  L. Hall,et al.  Three new consensus QSAR models for the prediction of Ames genotoxicity. , 2004, Mutagenesis.

[10]  Z. Szántó,et al.  Comparative three-dimensional quantitative structure-activity relationship study of safeners and herbicides. , 2000, Journal of agricultural and food chemistry.

[11]  Takahiro Suzuki,et al.  Classification of Environmental Estrogens by Physicochemical Properties Using Principal Component Analysis and Hierarchical Cluster Analysis , 2001, J. Chem. Inf. Comput. Sci..

[12]  Hxugo Kubiny Variable Selection in QSAR Studies. I. An Evolutionary Algorithm , 1994 .

[13]  Alexander Tropsha,et al.  Novel Variable Selection Quantitative Structure-Property Relationship Approach Based on the k-Nearest-Neighbor Principle , 2000, J. Chem. Inf. Comput. Sci..

[14]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[15]  S. Weisberg Plots, transformations, and regression , 1985 .

[16]  Alexander Tropsha,et al.  Recent Trends in Quantitative Structure‐Activity Relationships , 2003 .

[17]  Anton J. Hopfinger,et al.  Application of Genetic Function Approximation to Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships , 1994, J. Chem. Inf. Comput. Sci..

[18]  Alexander Golbraikh,et al.  Combinatorial QSAR of Ambergris Fragrance Compounds , 2004, J. Chem. Inf. Model..

[19]  Tudor I. Oprea,et al.  Three-dimensional quantitative structure-activity relationships of steroid aromatase inhibitors , 1996, J. Comput. Aided Mol. Des..

[20]  Han van de Waterbeemd,et al.  Chemometric methods in molecular design , 1995 .

[21]  Sung Jin Cho,et al.  Rational Combinatorial Library Design. 2. Rational Design of Targeted Combinatorial Peptide Libraries Using Chemical Similarity Probe and the Inverse QSAR Approaches , 1998, J. Chem. Inf. Comput. Sci..

[22]  Ettore Novellino,et al.  Use of comparative molecular field analysis and cluster analysis in series design , 1995 .

[23]  Alexander Golbraikh,et al.  Application of predictive QSAR models to database mining: identification and experimental validation of novel anticonvulsant compounds. , 2004, Journal of medicinal chemistry.

[24]  Alexander Golbraikh,et al.  Quantitative Structure−Activity Relationship Analysis of Functionalized Amino Acid Anticonvulsant Agents Using k Nearest Neighbor and Simulated Annealing PLS Methods , 2002 .

[25]  L. Pardo,et al.  Molecular determinants of MAO selectivity in a series of indolylmethylamine derivatives: biological activities, 3D-QSAR/CoMFA analysis, and computational simulation of ligand recognition. , 2000, Journal of medicinal chemistry.

[26]  Tudor I. Oprea,et al.  Rapid Estimation of Hydrophobicity for Virtual Combinatorial Library Analysis , 2001, SAR and QSAR in environmental research.

[27]  J L Katz,et al.  2D QSAR modeling and preliminary database searching for dopamine transporter inhibitors using genetic algorithm variable selection of Molconn Z descriptors. , 2000, Journal of medicinal chemistry.

[28]  S. J. Cho,et al.  "New tricks for an old dog": Development and application of novel QSAR methods for rational design of combinatorial chemical libraries and database mining , 1999 .

[29]  M. Karplus,et al.  Genetic neural networks for quantitative structure-activity relationships: improvements and application of benzodiazepine affinity for benzodiazepine/GABAA receptors. , 1996, Journal of medicinal chemistry.

[30]  J N Weinstein,et al.  Quantitative structure-antitumor activity relationships of camptothecin analogues: cluster analysis and genetic algorithm-based studies. , 2001, Journal of medicinal chemistry.

[31]  S. Wold,et al.  Statistical Validation of QSAR Results , 1995 .

[32]  M. Wolff,et al.  BURGER'S MEDICINAL CHEMISTRY AND DRUG DISCOVERY , 1996 .

[33]  John Mandel,et al.  The Regression Analysis of Collinear Data. , 1986, Journal of research of the National Bureau of Standards.

[34]  Osman F. Güner,et al.  Pharmacophore perception, development, and use in drug design , 2000 .

[35]  Alexander Golbraikh,et al.  Rational selection of training and test sets for the development of validated QSAR models , 2003, J. Comput. Aided Mol. Des..

[36]  Alexander Golbraikh,et al.  Predictive QSAR modeling based on diversity sampling of experimental datasets for the training and test set selection , 2002, J. Comput. Aided Mol. Des..

[37]  A. Cavalli,et al.  SAR of 9-amino-1,2,3,4-tetrahydroacridine-based acetylcholinesterase inhibitors: synthesis, enzyme inhibitory activity, QSAR, and structure-based CoMFA of tacrine analogues. , 2000, Journal of medicinal chemistry.

[38]  H. Kubinyi,et al.  Three-dimensional quantitative similarity-activity relationships (3D QSiAR) from SEAL similarity matrices. , 1998, Journal of medicinal chemistry.