Comprehensive Network Map of ADME‐Tox Databases

In the last decade, many statistical‐based approaches have been developed to improve poor pharmacokinetics (PK) and to reduce toxicity of lead compounds, which are one of the main causes of high failure rate in drug development. Predictive QSAR models are not always very efficient due to the low number of available biological data and the differences in the experimental protocols. Fortunately, the number of available databases continues to grow every year. However, it remains a challenge to determine the source and the quality of the original data. The main goal is to identify the relevant databases required to generate the most robust predictive models. In this study, an interactive network of databases was proposed to easily find online data sources related to ADME‐Tox parameters data. In this map, relevant information regarding scope of application, data availability and data redundancy can be obtained for each data source. To illustrate the usage of data mining from the network, a dataset on plasma protein binding is selected based on various sources such as DrugBank, PubChem and ChEMBL databases. A total of 2,606 unique molecules with experimental values of PPB were extracted and can constitute a consistent dataset for QSAR modeling.

[1]  K. Giacomini,et al.  The UCSF‐FDA TransPortal: A Public Drug Transporter Database , 2012, Clinical pharmacology and therapeutics.

[2]  Igor V. Tetko,et al.  Online chemical modeling environment (OCHEM): web platform for data storage, model development and publishing of chemical information , 2011, J. Comput. Aided Mol. Des..

[3]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[4]  Philip Wexler,et al.  Toxnet: A computerized collection of toxicological and environmental health information , 2000, Toxicology and industrial health.

[5]  Mick J. Ridley,et al.  Data governance in predictive toxicology: A review , 2011, J. Cheminformatics.

[6]  Alexander Tropsha,et al.  QSAR modeling of human serum protein binding with several modeling techniques utilizing structure-information representation. , 2006, Journal of medicinal chemistry.

[7]  H. Waterbeemd,et al.  Can the Internet help to meet the challenges in ADME and e-ADME? , 2002 .

[8]  Diana Montes-Grajales,et al.  EDCs DataBank: 3D-Structure database of endocrine disrupting chemicals. , 2015, Toxicology.

[9]  Henri Xhaard,et al.  Applying Linear and Non-Linear Methods for Parallel Prediction of Volume of Distribution and Fraction of Unbound Drug , 2013, PloS one.

[10]  Shane Weaver,et al.  The importance of the domain of applicability in QSAR modeling. , 2008, Journal of molecular graphics & modelling.

[11]  Kazuto Yamazaki,et al.  Computational prediction of the plasma protein-binding percent of diverse pharmaceutical compounds. , 2004, Journal of pharmaceutical sciences.

[12]  Pierre R. Bushel,et al.  Chemical effects in biological systems (CEBS) object model for toxicology data, SysTox-OM: design and application , 2006, Bioinform..

[13]  Philip E. Bourne,et al.  SuperTarget goes quantitative: update on drug–target interactions , 2011, Nucleic Acids Res..

[14]  R Daniel Benz,et al.  Landscape of current toxicity databases and database standards. , 2006, Current opinion in drug discovery & development.

[15]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[16]  Mario Lobell,et al.  In silico prediction of aqueous solubility, human plasma protein binding and volume of distribution of compounds from calculated pKa and AlogP98 values , 2004, Molecular Diversity.

[17]  Jie Shen,et al.  admetSAR: A Comprehensive Source and Free Tool for Assessment of Chemical ADMET Properties , 2012, J. Chem. Inf. Model..

[18]  Ann Richard,et al.  ACToR--Aggregated Computational Toxicology Resource. , 2008, Toxicology and applied pharmacology.

[19]  Andreas Bender,et al.  Metrabase: a cheminformatics and bioinformatics database for small molecule transporter data analysis and (Q)SAR modeling , 2015, Journal of Cheminformatics.

[20]  Lionel Colliandre,et al.  e-Drug3D: 3D structure collections dedicated to drug repurposing and fragment-based drug design , 2012, Bioinform..

[21]  Wei Liu,et al.  Predicting human plasma protein binding of drugs using plasma protein interaction QSAR analysis (PPI‐QSAR) , 2011, Biopharmaceutics & drug disposition.

[22]  Ulrike Schmidt,et al.  SuperToxic: a comprehensive database of toxic compounds , 2008, Nucleic Acids Res..

[23]  Igor V. Tetko,et al.  BIGCHEM: Challenges and Opportunities for Big Data Analysis in Chemistry , 2016, Molecular informatics.

[24]  Sarma A R P Jagarlapudi,et al.  Database systems for knowledge-based discovery. , 2009, Methods in molecular biology.

[25]  David S. Wishart,et al.  T3DB: a comprehensively annotated database of common toxins and their targets , 2009, Nucleic Acids Res..

[26]  Ajit Jadhav,et al.  Dealing with the Data Deluge: Handling the Multitude of Chemical Biology Data Sources , 2012, Current protocols in chemical biology.

[27]  M. Alpers Published online: , 2001 .

[28]  Junmei Wang,et al.  Genetic Algorithm-Optimized QSPR Models for Bioavailability, Protein Binding, and Urinary Excretion , 2006, J. Chem. Inf. Model..

[29]  Michael Y. Galperin,et al.  The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection , 2015, Nucleic Acids Res..

[30]  George Papadatos,et al.  The ChEMBL bioactivity database: an update , 2013, Nucleic Acids Res..

[31]  R. M. Owen,et al.  An analysis of the attrition of drug candidates from four major pharmaceutical companies , 2015, Nature Reviews Drug Discovery.

[32]  Hiroshi Yamada,et al.  Open TG-GATEs: a large-scale toxicogenomics database , 2014, Nucleic Acids Res..

[33]  Haibo Yu,et al.  hERGCentral: a large database to store, retrieve, and analyze compound-human Ether-à-go-go related gene channel interactions to facilitate cardiotoxicity assessment in drug development. , 2011, Assay and drug development technologies.

[34]  Robert Preissner,et al.  WITHDRAWN—a resource for withdrawn and discontinued drugs , 2015, Nucleic Acids Res..

[35]  M. Fielden,et al.  Development of a large-scale chemogenomics database to improve drug candidate selection and to understand mechanisms of chemical toxicity and action. , 2005, Journal of biotechnology.

[36]  Jacob Köhler,et al.  Addressing the problems with life-science databases for traditional uses and systems biology , 2006, Nature Reviews Genetics.

[37]  John Hodgson,et al.  ADMET—turning chemicals into drugs , 2001, Nature Biotechnology.

[38]  Saskia Preissner,et al.  The Transformer database: biotransformation of xenobiotics , 2013, Nucleic Acids Res..

[39]  Xin Chen,et al.  ADME-AP: a database of ADME associated proteins , 2002, Bioinform..

[40]  J. Dearden,et al.  QSAR modeling: where have you been? Where are you going to? , 2014, Journal of medicinal chemistry.

[41]  Dieter Lang,et al.  Predicting drug metabolism: experiment and/or computation? , 2015, Nature Reviews Drug Discovery.

[42]  Russ B. Altman,et al.  PharmGKB: the Pharmacogenetics Knowledge Base , 2002, Nucleic Acids Res..

[43]  Tingjun Hou,et al.  ADMET Evaluation in Drug Discovery. 11. PharmacoKinetics Knowledge Base (PKKB): A Comprehensive Database of Pharmacokinetic and Toxic Properties for Drugs , 2012, J. Chem. Inf. Model..

[44]  Henri Xhaard,et al.  IDAAPM: integrated database of ADMET and adverse effects of predictive modeling based on FDA approved drug data , 2016, Journal of Cheminformatics.

[45]  John P. Overington,et al.  Chemical databases: curation or integration by user-defined equivalence? , 2015, Drug discovery today. Technologies.

[46]  Kasper Daniel Hansen,et al.  The Bioconductor channel in F1000Research. , 2015, F1000Research.

[47]  Edward W. Lowe,et al.  Computational Methods in Drug Discovery , 2014, Pharmacological Reviews.

[48]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[49]  Shu-Shen Liu,et al.  The Use of Pseudo-Equilibrium Constant Affords Improved QSAR Models of Human Plasma Protein Binding , 2013, Pharmaceutical Research.