Statistical learning approach for predicting specific pharmacodynamic, pharmacokinetic, or toxicological properties of pharmaceutical agents

Pharmaceutical agents have been developed and tested for possessing desirable pharmacodynamic, pharmacokinetic, and minimal level of toxicological properties. Computational methods have been explored for predicting these properties aimed at the discovery of promising leads and the elimination of unsuitable ones in early stages of drug development. Statistical learning methods have shown their potential for predicting these properties for structurally diverse sets of agents by using both conventional (quantitative structure–activity and structure–property relationships) and more recently explored (such as neural networks and support vector machines) statistical models. These methods have been used for predicting agents of a variety of pharmacodynamic (such as inhibitors or agonists of a therapeutic target), pharmacokinetic (such as P‐glycoprotein substrates, human intestine absorption, and blood–brain barrier penetrating capabilities), and toxicological (such as genotoxicity) properties. The strategies, current progresses, and the underlying difficulties and future prospects of the application of the recently explored statistical learning methods are discussed. Drug Dev. Res. 66:245–259, 2006. © 2006 Wiley‐Liss, Inc.

[1]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[2]  Paola Gramatica,et al.  Validated QSAR Prediction of OH Tropospheric Degradation of VOCs: Splitting into Training-Test Sets and Consensus Modeling , 2004, J. Chem. Inf. Model..

[3]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[4]  A. J. Hopfinger,et al.  Predicting Blood–Brain Barrier Partitioning of Organic Molecules Using Membrane–Interaction QSAR Analysis , 2002, Pharmaceutical Research.

[5]  Sean B. Holden,et al.  Support Vector Machines for ADME Property Classification , 2003 .

[6]  Tatiana Nikolskaya,et al.  Early prediction of drug metabolism and toxicity: systems biology approach and modeling. , 2004, Drug discovery today.

[7]  C. B. Lucasius,et al.  Understanding and using genetic algorithms Part 1. Concepts, properties and context , 1993 .

[8]  Robert S. Pearlman,et al.  Metric Validation and the Receptor-Relevant Subspace Concept , 1999, J. Chem. Inf. Comput. Sci..

[9]  M Pirmohamed,et al.  Advances in molecular toxicology-towards understanding idiosyncratic drug toxicity. , 2000, Toxicology.

[10]  Corwin Hansch,et al.  QSAR and ADME. , 2004, Bioorganic & medicinal chemistry.

[11]  H. van de Waterbeemd,et al.  ADMET in silico modelling: towards prediction paradise? , 2003, Nature reviews. Drug discovery.

[12]  J. F. Wang,et al.  Prediction of P-Glycoprotein Substrates by a Support Vector Machine Approach , 2004, J. Chem. Inf. Model..

[13]  Dimitris K Agrafiotis,et al.  A method for quantifying and visualizing the diversity of QSAR models. , 2004, Journal of molecular graphics & modelling.

[14]  Junwei Zhang,et al.  Development of KiBank, a database supporting structure-based drug design , 2004, Comput. Biol. Chem..

[15]  M. Lader,et al.  Diazepam and N-desmethyldiazepam concentrations in saliva, plasma and CSF. , 1980, British journal of clinical pharmacology.

[16]  John H. Kalivas,et al.  Comparison of Forward Selection, Backward Elimination, and Generalized Simulated Annealing for Variable Selection , 1993 .

[17]  Alexander Tropsha,et al.  Quantitative structure-pharmacokinetic parameters relationships (QSPKR) analysis of antimicrobial agents in humans using simulated annealing k-nearest-neighbor and partial least-square analysis methods. , 2004, Journal of pharmaceutical sciences.

[18]  X. Chen,et al.  CLiBE: A Database of Computed Ligand Binding Energy for Ligand-receptor Complexes , 2002, Comput. Chem..

[19]  J P Doucet,et al.  QSAR and classification study of 1,4-dihydropyridine calcium channel antagonists based on least squares support vector machines. , 2005, Molecular pharmaceutics.

[20]  James A. Platts,et al.  Estimation of Molecular Linear Free Energy Relation Descriptors Using a Group Contribution Approach , 1999, J. Chem. Inf. Comput. Sci..

[21]  P. Jurs,et al.  Development of binary classification of structural chromosome aberrations for a diverse set of organic compounds from molecular structure. , 2003, Chemical research in toxicology.

[22]  Johann Gasteiger,et al.  The Coding of the Three-Dimensional Structure of Molecules by Molecular Transforms and Its Application to Structure-Spectra Correlations and Studies of Biological Activity , 1996, J. Chem. Inf. Comput. Sci..

[23]  Yu Zong Chen,et al.  Prediction of Cytochrome P450 3A4, 2D6, and 2C9 Inhibitors and Substrates by Using Support Vector Machines , 2005, J. Chem. Inf. Model..

[24]  A. Hopfinger A QSAR investigation of dihydrofolate reductase inhibition by Baker triazines based upon molecular shape analysis , 1980 .

[25]  M. Randic,et al.  Graph theoretical approach to local and overall aromaticity of benzenenoid hydrocarbons , 1975 .

[26]  H. Kubinyi QSAR and 3D QSAR in drug design Part 2: applications and problems , 1997 .

[27]  Joseph V. Turner,et al.  Bioavailability Prediction Based on Molecular Structure for a Diverse Series of Drugs , 2004, Pharmaceutical Research.

[28]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[29]  Walters Wp,et al.  Feature selection in quantitative structure-activity relationships. , 2005 .

[30]  R. E. White,et al.  High-throughput screening in drug metabolism and pharmacokinetic support of drug discovery. , 2000, Annual review of pharmacology and toxicology.

[31]  S. Ekins,et al.  Present and future in vitro approaches for drug metabolism. , 2000, Journal of pharmacological and toxicological methods.

[32]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery. 3. Modeling Blood-Brain Barrier Partitioning Using Simple Molecular Descriptors , 2003, J. Chem. Inf. Comput. Sci..

[33]  Johann Gasteiger,et al.  Deriving the 3D structure of organic molecules from their infrared spectra , 1999 .

[34]  M. Randic,et al.  MOLECULAR PROFILES NOVEL GEOMETRY-DEPENDENT MOLECULAR DESCRIPTORS , 1995 .

[35]  Grover,et al.  Quantitative structure-property relationships in pharmaceutical research - Part 2. , 2000, Pharmaceutical science & technology today.

[36]  Cesare Furlanello,et al.  An accelerated procedure for recursive feature ranking on microarray data , 2003, Neural Networks.

[37]  Mario Lobell,et al.  In silico prediction of aqueous solubility, human plasma protein binding and volume of distribution of compounds from calculated pKa and AlogP98 values , 2004, Molecular Diversity.

[38]  M. Karelson,et al.  QSPR as a means of predicting and understanding chemical and physical properties in terms of structure , 1997 .

[39]  Xin Chen,et al.  Effect of Molecular Descriptor Feature Selection in Support Vector Machine Classification of Pharmacokinetic and Toxicological Properties of Chemical Agents , 2004, J. Chem. Inf. Model..

[40]  D. Manallack,et al.  Neural networks in drug discovery: Have they lived up to their promise? , 1999 .

[41]  S. Walker,et al.  Pharmaceutical innovation by the seven UK-owned pharmaceutical companies (1964-1985). , 1988, British journal of clinical pharmacology.

[42]  B. Roth,et al.  The Multiplicity of Serotonin Receptors: Uselessly Diverse Molecules or an Embarrassment of Riches? , 2000 .

[43]  Jorge Gálvez,et al.  Charge Indexes. New Topological Descriptors , 1994, J. Chem. Inf. Comput. Sci..

[44]  Thomas Hofmann,et al.  Predicting CNS Permeability of Drug Molecules: Comparison of Neural Network and Support Vector Machine Algorithms , 2002, J. Comput. Biol..

[45]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.

[46]  Zhi-Wei Cao,et al.  Effect of Selection of Molecular Descriptors on the Prediction of Blood-Brain Barrier Penetrating and Nonpenetrating Agents by Statistical Learning Methods , 2005, J. Chem. Inf. Model..

[47]  Roberto Todeschini,et al.  MS-WHIM, new 3D theoretical descriptors derived from molecular surface properties: A comparative 3D QSAR study in a series of steroids , 1997, J. Comput. Aided Mol. Des..

[48]  Y. Z. Chen,et al.  Quantitative Structure-Pharmacokinetic Relationships for drug distribution properties by using general regression neural network. , 2005, Journal of pharmaceutical sciences.

[49]  Gerta Rücker,et al.  Counts of all walks as atomic and molecular descriptors , 1993, J. Chem. Inf. Comput. Sci..

[50]  David T. Stanton,et al.  On the Physical Interpretation of QSAR Models , 2003, J. Chem. Inf. Comput. Sci..

[51]  J. Drews Drug discovery: a historical perspective. , 2000, Science.

[52]  H. Yu,et al.  Discovering compact and highly discriminative features or combinations of drug activities using support vector machines , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[53]  M Pastor,et al.  VolSurf: a new tool for the pharmacokinetic optimization of lead compounds. , 2000, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[54]  Z R Li,et al.  Prediction of genotoxicity of chemical compounds by statistical learning methods. , 2005, Chemical research in toxicology.

[55]  M. Bayliss,et al.  Combining high-throughput pharmacokinetic screens at the hits-to-leads stage of drug discovery. , 2000, Drug discovery today.

[56]  Roberto Todeschini,et al.  Structure/Response Correlations and Similarity/Diversity Analysis by GETAWAY Descriptors, 1. Theory of the Novel 3D Molecular Descriptors , 2002, J. Chem. Inf. Comput. Sci..

[57]  Zheng Rong Yang,et al.  Evaluation of Mutual Information and Genetic Programming for Feature Selection in QSAR , 2004, J. Chem. Inf. Model..

[58]  Bernhard Schölkopf,et al.  Feature selection and transduction for prediction of molecular bioactivity for drug design , 2003, Bioinform..

[59]  Ying Liu,et al.  A Comparative Study on Feature Selection Methods for Drug Discovery , 2004, J. Chem. Inf. Model..

[60]  J. Caldwell,et al.  An Introduction to Drug Disposition: The Basic Principles of Absorption, Distribution, Metabolism, and Excretion , 1995, Toxicologic pathology.