Cheminformatics: At the Crossroad of Eras

In this chapter, we discuss how the profusion of experimental chemogenomics data available in public repositories is transforming the field of cheminformatics. In particular, we describe (i) both theoretical and technical challenges related to the management, analysis, and visualization of large and diverse chemical datasets, (ii) the unique opportunities offered by Big Chemical Data for designing molecules with the desired properties and expanding the use of cheminformatics in novel areas of research, and (iii) some innovative approaches that are likely to shape the future of cheminformatics.

[1]  A. Tropsha,et al.  Quantitative nanostructure-activity relationship modeling. , 2010, ACS nano.

[2]  Alexander Golbraikh,et al.  Integrative chemical-biological read-across approach for chemical hazard classification. , 2013, Chemical research in toxicology.

[3]  Xiang-Qun Xie,et al.  Exploiting PubChem for virtual screening , 2010, Expert opinion on drug discovery.

[4]  C. Austin,et al.  Improving the Human Hazard Characterization of Chemicals: A Tox21 Update , 2013, Environmental health perspectives.

[5]  Matthias Rarey,et al.  Fast force field‐based optimization of protein–ligand complexes with graphics processor , 2012, J. Comput. Chem..

[6]  Jürgen Bajorath,et al.  A Perspective on Computational Chemogenomics , 2013, Molecular informatics.

[7]  David M. Reif,et al.  Profiling 976 ToxCast Chemicals across 331 Enzymatic and Receptor Signaling Assays , 2013, Chemical research in toxicology.

[8]  Alexander Tropsha,et al.  Using Graph Indices for the Analysis and Comparison of Chemical Datasets , 2013, Molecular informatics.

[9]  Alexandre Varnek,et al.  Estimation of the size of drug-like chemical space based on GDB-17 data , 2013, Journal of Computer-Aided Molecular Design.

[10]  Alexander Tropsha,et al.  Cheminformatics analysis of assertions mined from literature that describe drug-induced liver injury in different species. , 2010, Chemical research in toxicology.

[11]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[12]  Antony J. Williams,et al.  Dispensing Processes Impact Apparent Biological Activity as Determined by Computational and Statistical Analyses , 2013, PloS one.

[13]  Alexandre Varnek,et al.  QSPR Modeling of the AmIII/EuIII Separation Factor: How Far Can we Predict ? , 2007 .

[14]  F. Prinz,et al.  Believe it or not: how much can we rely on published data on potential drug targets? , 2011, Nature Reviews Drug Discovery.

[15]  D Rognan,et al.  Towards the Next Generation of Computational Chemogenomics Tools , 2013, Molecular informatics.

[16]  Christopher G. Chute,et al.  Some experiences and opportunities for big data in translational research , 2013, Genetics in Medicine.

[17]  Bradley M. Hemminger,et al.  Mining connections between chemicals, proteins, and diseases extracted from Medline annotations , 2010, J. Biomed. Informatics.

[18]  William L. Jorgensen,et al.  QSAR/QSPR and Proprietary Data , 2006, Journal of Chemical Information and Modeling.

[19]  D. Coddington,et al.  The big deal about big data. , 2013, Healthcare financial management : journal of the Healthcare Financial Management Association.

[20]  Michael J. Keiser,et al.  Predicting new molecular targets for known drugs , 2009, Nature.

[21]  D. Young,et al.  Are the Chemical Structures in Your QSAR Correct , 2008 .

[22]  Alexander Tropsha,et al.  Exploring quantitative nanostructure-activity relationships (QNAR) modeling as a tool for predicting biological effects of manufactured nanoparticles. , 2011, Combinatorial chemistry & high throughput screening.

[23]  D D Allen,et al.  Nanoparticle Technology for Drug Delivery Across the Blood-Brain Barrier , 2002, Drug development and industrial pharmacy.

[24]  David M. Reif,et al.  Profiling of the Tox21 10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway , 2014, Scientific Reports.

[25]  Alexander Tropsha,et al.  Trust, But Verify: On the Importance of Chemical Structure Curation in Cheminformatics and QSAR Modeling Research , 2010, J. Chem. Inf. Model..

[26]  Pawel L Urban,et al.  Nanoparticles: their potential toxicity, waste and environmental management. , 2009, Waste management.

[27]  Karen Florini,et al.  Getting It Right the First Time , 2006, Annals of the New York Academy of Sciences.

[28]  Jean-Louis Reymond,et al.  Visualization and Virtual Screening of the Chemical Universe Database GDB-17 , 2013, J. Chem. Inf. Model..

[29]  Richard A. L. Jones Nanotechnology, energy and markets. , 2009, Nature nanotechnology.

[30]  Damian Szklarczyk,et al.  STITCH 3: zooming in on protein–chemical interactions , 2011, Nucleic Acids Res..

[31]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[32]  Ken Donaldson,et al.  Nanotoxicology: new insights into nanotubes. , 2009, Nature nanotechnology.

[33]  A. Carnero,et al.  High throughput screening in drug discovery , 2006, Clinical & translational oncology : official publication of the Federation of Spanish Oncology Societies and of the National Cancer Institute of Mexico.

[34]  Richard S. Judson,et al.  Profiling the activity of environmental chemicals in prenatal developmental toxicity studies using the U.S. EPA's ToxRefDB. , 2009, Reproductive toxicology.

[35]  Ivan Rusyn,et al.  Predicting drug-induced hepatotoxicity using QSAR and toxicogenomics approaches. , 2011, Chemical research in toxicology.

[36]  Alexander Tropsha,et al.  Best Practices for QSAR Model Development, Validation, and Exploitation , 2010, Molecular informatics.

[37]  D. Fourches,et al.  Successful “In Silico” Design of New Efficient Uranyl Binders , 2007 .

[38]  Alexander Tropsha,et al.  Why Academic Drug Discovery Makes Sense , 2006, Science.

[39]  Igor Linkov,et al.  Nanotoxicology and nanomedicine: making hard decisions. , 2008, Nanomedicine : nanotechnology, biology, and medicine.