The Interweaving of Cheminformatics and HTS.

The aim of this chapter is to describe the stages of early drug discovery that can be assisted by techniques commonly used in the field of cheminformatics. In fact, cheminformatics tools can be applied all the way from the design of compound libraries and the analysis of HTS results, to the discovery of functional relationships between compounds and their targets.

[1]  Jürgen Bajorath,et al.  Development of a Fingerprint Reduction Approach for Bayesian Similarity Searching Based on Kullback-Leibler Divergence Analysis , 2009, J. Chem. Inf. Model..

[2]  Felix Deanda,et al.  Kinase-Targeted Library Design through the Application of the PharmPrint Methodology , 2008, J. Chem. Inf. Model..

[3]  Y. Martin Diverse viewpoints on computational aspects of molecular diversity. , 2001, Journal of combinatorial chemistry.

[4]  D. Pereira,et al.  Origin and evolution of high throughput screening , 2007, British journal of pharmacology.

[5]  S. Schreiber,et al.  Target-oriented and diversity-oriented organic synthesis in drug discovery. , 2000, Science.

[6]  Tudor I. Oprea,et al.  Chemical database preparation for compound acquisition or virtual screening. , 2006, Methods in molecular biology.

[7]  J. Burch,et al.  High-content imaging characterization of cell cycle therapeutics through in vitro and in vivo subpopulation analysis , 2008, Molecular Cancer Therapeutics.

[8]  Scott Boyer,et al.  Ligand-Based Approach to In Silico Pharmacology: Nuclear Receptor Profiling , 2006, J. Chem. Inf. Model..

[9]  Herbert Waldmann,et al.  Compound library development guided by protein structure similarity clustering and natural product structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Jürgen Bajorath,et al.  Bit Silencing in Fingerprints Enables the Derivation of Compound Class-Directed Similarity Metrics , 2008, J. Chem. Inf. Model..

[11]  Marc Bickle,et al.  Image Analysis in High Content Screening , 2009 .

[12]  Christoph Steinbeck,et al.  Classification and comparison of ligand-binding sites derived from grid-mapped knowledge-based potentials. , 2006, Journal of molecular graphics & modelling.

[13]  Dimitris K. Agrafiotis,et al.  Design and Prioritization of Plates for High-Throughput Screening , 2001, J. Chem. Inf. Comput. Sci..

[14]  J. Jenkins,et al.  A 3D similarity method for scaffold hopping from known drugs or natural ligands to new chemotypes. , 2004, Journal of medicinal chemistry.

[15]  G. Rishton Reactive compounds and in vitro false positives in HTS , 1997 .

[16]  Sara H Fitzgerald,et al.  Survey of the diversity space coverage of reported combinatorial libraries. , 2007, Journal of combinatorial chemistry.

[17]  Didier Rognan,et al.  Assessing the Scaffold Diversity of Screening Libraries , 2006, J. Chem. Inf. Model..

[18]  Tudor I. Oprea,et al.  Strategies for compound selection. , 2004, Current drug discovery technologies.

[19]  T. Insel,et al.  NIH Molecular Libraries Initiative , 2004, Science.

[20]  Aideen Long,et al.  Statistical methods for analysis of high-throughput RNA interference screens , 2009, Nature Methods.

[21]  Thomas Sander,et al.  Comparison of Ligand- and Structure-Based Virtual Screening on the DUD Data Set , 2009, J. Chem. Inf. Model..

[22]  Eric Maréchal Chemogenomics: a discipline at the crossroad of high throughput technologies, biomarker research, combinatorial chemistry, genomics, cheminformatics, bioinformatics and artificial intelligence. , 2008, Combinatorial chemistry & high throughput screening.

[23]  Edgar Jacoby,et al.  The 7 TM G‐Protein‐Coupled Receptor Target Family , 2006, ChemMedChem.

[24]  Anne Mai Wassermann,et al.  Searching for Target-Selective Compounds Using Different Combinations of Multiclass Support Vector Machine Ranking Methods, Kernel Functions, and Fingerprint Descriptors , 2009, J. Chem. Inf. Model..

[25]  A. Carnero,et al.  High throughput screening in drug discovery , 2006, Clinical & translational oncology : official publication of the Federation of Spanish Oncology Societies and of the National Cancer Institute of Mexico.

[26]  Bin Chen,et al.  Gaining Insight into Off-Target Mediated Effects of Drug Candidates with a Comprehensive Systems Chemical Biology Analysis , 2009, J. Chem. Inf. Model..

[27]  Daniel Rauh,et al.  An Unbiased Cell Morphology–Based Screen for New, Biologically Active Small Molecules , 2005, PLoS biology.

[28]  Charles Y. Tao,et al.  A Support Vector Machine Classifier for Recognizing Mitotic Subphases Using High-Content Screening Data , 2007, Journal of biomolecular screening.

[29]  Christopher P Austin,et al.  High-throughput screening assays for the identification of chemical probes. , 2007, Nature chemical biology.

[30]  Peter G Schultz,et al.  Reprogramming of murine fibroblasts to induced pluripotent stem cells with chemical complementation of Klf4 , 2009, Proceedings of the National Academy of Sciences.

[31]  James L. Melville,et al.  Novel lead structures for p38 MAP kinase via FieldScreen virtual screening. , 2009, Journal of medicinal chemistry.

[32]  J. Mesirov,et al.  Automated high-dimensional flow cytometric data analysis , 2009, Proceedings of the National Academy of Sciences.

[33]  Oliver Dürr,et al.  Robust Hit Identification by Quality Assurance and Multivariate Data Analysis of a High-Content, Cell-Based Assay , 2007, Journal of biomolecular screening.

[34]  William J. Welch,et al.  Uniform Coverage Designs for Molecule Selection , 2002, Technometrics.

[35]  Cynthia L Adams,et al.  Compound classification using image-based cellular phenotypes. , 2006, Methods in enzymology.

[36]  B. Shoichet,et al.  Identification and prediction of promiscuous aggregating inhibitors among known drugs. , 2003, Journal of medicinal chemistry.

[37]  Andreas Bender,et al.  "Bayes Affinity Fingerprints" Improve Retrieval Rates in Virtual Screening and Define Orthogonal Bioactivity Space: When Are Multitarget Drugs a Feasible Concept? , 2006, J. Chem. Inf. Model..

[38]  Zhen Li,et al.  A comparison of machine learning algorithms for chemical toxicity classification using a simulated multi-scale data model , 2008, BMC Bioinformatics.

[39]  Veerabahu Shanmugasundaram,et al.  Molecular similarity measures. , 2011, Methods in molecular biology.

[40]  Rajarshi Guha,et al.  Structure-Activity Landscape Index: Identifying and Quantifying Activity Cliffs , 2008, J. Chem. Inf. Model..

[41]  Eugen Lounkine,et al.  Improving the Search Performance of Extended Connectivity Fingerprints through Activity‐Oriented Feature Filtering and Application of a Bit‐Density‐Dependent Similarity Function , 2009, ChemMedChem.

[42]  T. Guo,et al.  Privileged structure-based combinatorial libraries targeting G protein-coupled receptors. , 2003, Assay and drug development technologies.

[43]  Mark Johnson,et al.  Chemotypic Coverage: A New Basis for Constructing Screening Sublibraries , 2009, J. Chem. Inf. Model..

[44]  A. Fliri,et al.  Biospectra analysis: model proteome characterizations for linking molecular structure and biological response. , 2005, Journal of medicinal chemistry.

[45]  C. Harris,et al.  How large does a compound screening collection need to be? , 2008, Combinatorial chemistry & high throughput screening.

[46]  G. Bemis,et al.  The properties of known drugs. 1. Molecular frameworks. , 1996, Journal of medicinal chemistry.

[47]  U Schopfer,et al.  Molecular diversity management strategies for building and enhancement of diverse and focused lead discovery compound screening collections. , 2004, Combinatorial chemistry & high throughput screening.

[48]  Peter Willett,et al.  Analysis of Data Fusion Methods in Virtual Screening: Similarity and Group Fusion , 2006, J. Chem. Inf. Model..

[49]  Suzanne K. Schreyer,et al.  Data Shaving: A Focused Screening Approach , 2004, J. Chem. Inf. Model..

[50]  J. Blake,et al.  Chemoinformatics - predicting the physicochemical properties of 'drug-like' molecules. , 2000, Current opinion in biotechnology.

[51]  H. Verheij,et al.  Leadlikeness and structural diversity of synthetic screening libraries , 2006, Molecular Diversity.

[52]  Yun He,et al.  Learning from the Data: Mining of Large High-Throughput Screening Databases , 2006, J. Chem. Inf. Model..

[53]  D. Kell,et al.  'Metabolite-likeness' as a criterion in the design and selection of pharmaceutical drug libraries. , 2009, Drug discovery today.

[54]  Ricardo Macarron,et al.  Critical review of the role of HTS in drug discovery. , 2006, Drug discovery today.

[55]  D. C. McShan Towards Inference of a Biochemical Ontology From a Metabolic Database , 2005, Comparative and functional genomics.

[56]  Stefan Wetzel,et al.  Bioactivity-guided mapping and navigation of chemical space. , 2009, Nature chemical biology.

[57]  William J. Welsh,et al.  New Predictive Models for Blood–Brain Barrier Permeability of Drug-like Molecules , 2008, Pharmaceutical Research.

[58]  Herbert Waldmann,et al.  Protein structure similarity clustering (PSSC) and natural product structure as inspiration sources for drug development and chemical genomics. , 2005, Current opinion in chemical biology.

[59]  Meir Glick,et al.  Enrichment of High-Throughput Screening Data with Increasing Levels of Noise Using Support Vector Machines, Recursive Partitioning, and Laplacian-Modified Naive Bayesian Classifiers , 2006, J. Chem. Inf. Model..

[60]  A. Fliri,et al.  Biological spectra analysis: Linking biological activity profiles to molecular structure. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[61]  Rajiv Sharma,et al.  Design and synthesis of protein superfamily-targeted chemical libraries for lead identification and optimization. , 2005, Current medicinal chemistry.

[62]  Gisbert Schneider,et al.  Scaffold diversity of natural products: inspiration for combinatorial library design. , 2008, Natural product reports.

[63]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[64]  Robert Gentleman,et al.  flowCore: a Bioconductor package for high throughput flow cytometry , 2009, BMC Bioinformatics.

[65]  A. Bender,et al.  Modeling Promiscuity Based on in vitro Safety Pharmacology Profiling Data , 2007, ChemMedChem.

[66]  Chris Williams,et al.  Reverse fingerprinting, similarity searching by group fusion and fingerprint bit importance , 2006, Molecular Diversity.

[67]  Andreas Bender,et al.  Plate-Based Diversity Selection Based on Empirical HTS Data to Enhance the Number of Hits and Their Chemical Diversity , 2009, Journal of biomolecular screening.

[68]  S. Schreiber,et al.  Small molecules efficiently direct endodermal differentiation of mouse and human embryonic stem cells. , 2009, Cell stem cell.

[69]  B. Shoichet,et al.  A specific mechanism of nonspecific inhibition. , 2003, Journal of medicinal chemistry.

[70]  J. Lehár,et al.  Synergistic drug combinations improve therapeutic selectivity , 2009, Nature Biotechnology.

[71]  L Xue,et al.  Molecular descriptors in chemoinformatics, computational combinatorial chemistry, and virtual screening. , 2000, Combinatorial chemistry & high throughput screening.

[72]  Thomas Hesterkamp,et al.  A multiplexed approach to hit finding. , 2009, Current opinion in drug discovery & development.

[73]  G. Schneider,et al.  GPCR Targeted Library Design: Novel Dopamine D3 Receptor Ligands , 2007, ChemMedChem.

[74]  Kunal Roy,et al.  Docking and 3D QSAR studies of protoporphyrinogen oxidase inhibitor 3H-pyrazolo[3,4-d][1,2,3]triazin-4-one derivatives , 2010, Journal of molecular modeling.

[75]  Joe Zhongxiang Zhou,et al.  Structure-directed combinatorial library design. , 2008, Current opinion in chemical biology.

[76]  Alessandro Pedretti,et al.  Assessing drug-likeness--what are we missing? , 2008, Drug discovery today.

[77]  Jürgen Bajorath,et al.  Emerging Chemical Patterns: A New Methodology for Molecular Classification and Compound Selection , 2006, J. Chem. Inf. Model..

[78]  Heather S. Gruver,et al.  Data analysis of high-throughput screening results: application of multidomain clustering to the NCI anti-HIV data set. , 2002, Journal of medicinal chemistry.

[79]  G. Maggiora,et al.  Hit-directed nearest-neighbor searching. , 2005, Journal of medicinal chemistry.

[80]  N Fotouhi,et al.  Application and utilization of chemoinformatics tools in lead generation and optimization. , 2006, Combinatorial chemistry & high throughput screening.

[81]  Edgar Jacoby,et al.  Chemogenomics knowledge-based strategies in drug discovery. , 2003, Drug news & perspectives.

[82]  Lani F. Wu,et al.  Image-based multivariate profiling of drug responses from single cells , 2007, Nature Methods.

[83]  Jing Li,et al.  Novel Statistical Approach for Primary High-Throughput Screening Hit Selection , 2005, J. Chem. Inf. Model..

[84]  Sangsoo Kim,et al.  GS2PATH: A web-based integrated analysis tool for finding functional relationships using gene ontology and biochemical pathway data , 2007, Bioinformation.

[85]  Jürgen Bajorath,et al.  Simulation of sequential screening experiments using emerging chemical patterns. , 2008, Medicinal chemistry (Shariqah (United Arab Emirates)).

[86]  Christos A. Nicolaou,et al.  Analysis of Large Screening Data Sets via Adaptively Grown Phylogenetic-Like Trees , 2002, J. Chem. Inf. Comput. Sci..

[87]  Irini Akritopoulou-Zanze,et al.  Kinase-targeted libraries: the design and synthesis of novel, potent, and selective kinase inhibitors. , 2009, Drug discovery today.

[88]  Andreas Bender,et al.  Understanding False Positives in Reporter Gene Assays: in Silico Chemogenomics Approaches To Prioritize Cell-Based HTS Data , 2007, J. Chem. Inf. Model..

[89]  Kunal Roy,et al.  Docking and 3D-QSAR studies of acetohydroxy acid synthase inhibitor sulfonylurea derivatives , 2010, Journal of molecular modeling.

[90]  Herbert Schulz,et al.  A genome-scale RNAi screen for Oct4 modulators defines a role of the Paf1 complex for embryonic stem cell identity. , 2009, Cell stem cell.

[91]  Robert Smith,et al.  Closing the gap between centralized and decentralized compound management approaches. , 2008, Combinatorial chemistry & high throughput screening.

[92]  M. Milik,et al.  Mapping adverse drug reactions in chemical space. , 2009, Journal of medicinal chemistry.

[93]  P. Clemons,et al.  Chemogenomic data analysis: prediction of small-molecule targets and the advent of biological fingerprint. , 2007, Combinatorial chemistry & high throughput screening.

[94]  Eugen Lounkine,et al.  Relevance of Feature Combinations for Similarity Searching Using General or Activity Class-Directed Molecular Fingerprints , 2009, J. Chem. Inf. Model..

[95]  Mark Johnson,et al.  Using Molecular Equivalence Numbers To Visually Explore Structural Features that Distinguish Chemical Libraries , 2002, J. Chem. Inf. Comput. Sci..

[96]  Tudor I. Oprea,et al.  A crowdsourcing evaluation of the NIH chemical probes. , 2009, Nature chemical biology.

[97]  L Alex Gaither,et al.  Chemogenomics approaches to novel target discovery , 2007, Expert review of proteomics.

[98]  Mathias Wawer,et al.  Navigating structure-activity landscapes. , 2009, Drug discovery today.

[99]  Anne Kümmel,et al.  Integration of Multiple Readouts into the Z' Factor for Assay Quality Assessment , 2010, Journal of biomolecular screening.

[100]  Stefan Wetzel,et al.  The Scaffold Tree - Visualization of the Scaffold Universe by Hierarchical Scaffold Classification , 2007, J. Chem. Inf. Model..

[101]  Jürgen Bajorath,et al.  Recursive Median Partitioning for Virtual Screening of Large Databases , 2003, J. Chem. Inf. Comput. Sci..

[102]  John A. Tallarico,et al.  Integrating high-content screening and ligand-target prediction to identify mechanism of action. , 2008, Nature chemical biology.

[103]  Lorenz M Mayr,et al.  The Future of High-Throughput Screening , 2008, Journal of biomolecular screening.

[104]  Rajarshi Guha,et al.  Chemoinformatic Analysis of Combinatorial Libraries, Drugs, Natural Products, and Molecular Libraries Small Molecule Repository , 2009, J. Chem. Inf. Model..

[105]  Paul A Johnston,et al.  Requirements, features, and performance of high content screening platforms. , 2007, Methods in molecular biology.

[106]  Adrian H Elcock,et al.  Structure selection for protein kinase docking and virtual screening: homology models or crystal structures? , 2006, Current protein & peptide science.

[107]  Peter Ertl,et al.  Natural Product-likeness Score and Its Application for Prioritization of Compound Libraries , 2008, J. Chem. Inf. Model..

[108]  Camille G Wermuth,et al.  Selective optimization of side activities: the SOSA approach. , 2006, Drug discovery today.

[109]  Kai Huang,et al.  Feature reduction for improved recognition of subcellular location patterns in fluorescence microscope images , 2003, SPIE BiOS.

[110]  Adam Yasgar,et al.  Quantitative high-throughput screening: a titration-based approach that efficiently identifies biological activities in large chemical libraries. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[111]  S. Wetzel,et al.  Biology-inspired synthesis of compound libraries , 2008, Cellular and Molecular Life Sciences.

[112]  Caroline Engeloch,et al.  Statistical evaluation of a self-deconvoluting matrix strategy for high-throughput screening of the CXCR3 receptor. , 2005, Assay and drug development technologies.

[113]  L Martin Cloutier,et al.  Bayesian versus Frequentist statistical modeling: a debate for hit selection from HTS campaigns. , 2008, Drug discovery today.

[114]  Stephen Buxser,et al.  Use of mixture distributions to deconvolute the behavior of "hits" and controls in high-throughput screening data. , 2007, Analytical biochemistry.

[115]  Andreas Bender,et al.  “Plate Cherry Picking”: A Novel Semi-Sequential Screening Paradigm for Cheaper, Faster, Information-Rich Compound Selection , 2007, Journal of biomolecular screening.

[116]  Robert Gentleman,et al.  Knowledge based identification of essential signaling from genome-scale siRNA experiments , 2009, BMC Systems Biology.

[117]  Ting Chen,et al.  R-NN Curves: An Intuitive Approach to Outlier Detection Using a Distance Based Method , 2006, J. Chem. Inf. Model..

[118]  Ian A. Watson,et al.  Dissimilarity-based approaches to compound acquisition. , 2008, Current opinion in chemical biology.