TDR Targets 6: driving drug discovery for human pathogens through intensive chemogenomic data integration

Abstract The volume of biological, chemical and functional data deposited in the public domain is growing rapidly, thanks to next generation sequencing and highly-automated screening technologies. These datasets represent invaluable resources for drug discovery, particularly for less studied neglected disease pathogens. To leverage these datasets, smart and intensive data integration is required to guide computational inferences across diverse organisms. The TDR Targets chemogenomics resource integrates genomic data from human pathogens and model organisms along with information on bioactive compounds and their annotated activities. This report highlights the latest updates on the available data and functionality in TDR Targets 6. Based on chemogenomic network models providing links between inhibitors and targets, the database now incorporates network-driven target prioritizations, and novel visualizations of network subgraphs displaying chemical- and target-similarity neighborhoods along with associated target-compound bioactivity links. Available data can be browsed and queried through a new user interface, that allow users to perform prioritizations of protein targets and chemical inhibitors. As such, TDR Targets now facilitates the investigation of drug repurposing against pathogen targets, which can potentially help in identifying candidate targets for bioactive compounds with previously unknown targets. TDR Targets is available at https://tdrtargets.org.

[1]  S. Cole,et al.  The MycoBrowser portal: a comprehensive and manually annotated resource for mycobacterial genomes. , 2011, Tuberculosis.

[2]  M. Okoniewski,et al.  Asexual expansion of Toxoplasma gondii merozoites is distinct from tachyzoites and entails expression of non-overlapping gene families to attach, invade, and replicate within feline enterocytes , 2015, BMC Genomics.

[3]  Konstantinos D. Tsirigos,et al.  SignalP 5.0 improves signal peptide predictions using deep neural networks , 2019, Nature Biotechnology.

[4]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[5]  Ulrike Böhme,et al.  A comprehensive evaluation of rodent malaria parasite genomes and gene expression , 2014, BMC Biology.

[6]  Haiming Wang,et al.  ToxoDB: an integrated Toxoplasma gondii database resource , 2007, Nucleic Acids Res..

[7]  S. Goldenberg,et al.  Ribosome profiling reveals translation control as a key mechanism generating differential gene expression in Trypanosoma cruzi , 2015, BMC Genomics.

[8]  David M. Rocke,et al.  Transcriptomic Analysis of Toxoplasma Development Reveals Many Novel Functions and Structures Specific to Sporozoites and Oocysts , 2012, PloS one.

[9]  Lee M. Yeoh,et al.  Comparative transcriptomics of female and male gametocytes in Plasmodium berghei and the evolution of sex in alveolates , 2017, BMC Genomics.

[10]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[11]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[12]  J. Rayner,et al.  Functional Profiling of a Plasmodium Genome Reveals an Abundance of Essential Genes , 2017, Cell.

[13]  Silvio C. E. Tosatto,et al.  InterPro in 2019: improving coverage, classification and access to protein sequence annotations , 2018, Nucleic Acids Res..

[14]  E. Brown,et al.  Drug repurposing for antimicrobial discovery , 2019, Nature Microbiology.

[15]  wwPDB consortium,et al.  Protein Data Bank: the single global archive for 3D macromolecular structure data , 2019, Nucleic Acids Res..

[16]  Feng Chen,et al.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups , 2005, Nucleic Acids Res..

[17]  T. Bousema,et al.  Integrated transcriptomic and proteomic analyses of P. falciparum gametocytes: molecular insight into sex-specific processes and translational repression , 2016, Nucleic acids research.

[18]  Jean-Louis Reymond,et al.  SmilesDrawer: Parsing and Drawing SMILES-Encoded Molecular Structures Using Client-Side JavaScript , 2018, J. Chem. Inf. Model..

[19]  Matthew Fraser,et al.  InterProScan 5: genome-scale protein function classification , 2014, Bioinform..

[20]  Stephen R. Heller,et al.  InChI, the IUPAC International Chemical Identifier , 2015, Journal of Cheminformatics.

[21]  Norbert Haider,et al.  Functionality Pattern Matching as an Efficient Complementary Structure/Reaction Search Tool: an Open-Source Approach , 2010, Molecules.

[22]  Ian H Gilbert,et al.  Target Validation: Linking Target and Chemical Properties to Desired Product Profile , 2011, Current topics in medicinal chemistry.

[23]  Pier Luigi Martelli,et al.  PredGPI: a GPI-anchor predictor , 2008, BMC Bioinformatics.

[24]  Xin Gao,et al.  Using OrthoMCL to assign proteins to OrthoMCL-DB groups or to cluster proteomes into new ortholog groups. , 2011, Current protocols in bioinformatics.

[25]  S. Rees,et al.  Principles of early drug discovery , 2011, British journal of pharmacology.

[26]  Yuan Zhao,et al.  Computation of Octanol-Water Partition Coefficients by Guiding an Additive Model with Knowledge , 2007, J. Chem. Inf. Model..

[27]  Andrew Dalke chemfp - fast and portable fingerprint formats and tools , 2011, J. Cheminformatics.

[28]  C. Fishwick,et al.  CSGID Solves Structures and Identifies Phenotypes for Five Enzymes in Toxoplasma gondii , 2018, Front. Cell. Infect. Microbiol..

[29]  David S. Roos,et al.  Identification of Attractive Drug Targets in Neglected-Disease Pathogens Using an In Silico Approach , 2010, PLoS neglected tropical diseases.

[30]  P. Hotez,et al.  Control of neglected tropical diseases. , 2007, The New England journal of medicine.

[31]  Jeremy N. Burrows,et al.  The Open Access Malaria Box: A Drug Discovery Catalyst for Neglected Diseases , 2013, PloS one.

[32]  Pierre Lechat,et al.  GenoList: an integrated environment for comparative analysis of microbial genomes , 2007, Nucleic Acids Res..

[33]  Julien Guizetti,et al.  A Specific PfEMP1 Is Expressed in P. falciparum Sporozoites and Plays a Role in Hepatocyte Infection , 2018, Cell reports.

[34]  C. Hon,et al.  Quantification of stochastic noise of splicing and polyadenylation in Entamoeba histolytica , 2012, Nucleic acids research.

[35]  Santiago J. Carmona,et al.  Integrating and Mining Helminth Genomes to Discover and Prioritize Novel Therapeutic Targets , 2012 .

[36]  Samuel A. Assefa,et al.  New insights into the blood-stage transcriptome of Plasmodium falciparum using RNA-Seq , 2010, Molecular microbiology.

[37]  Sean Ekins,et al.  High Throughput and Computational Repurposing for Neglected Diseases , 2018, Pharmaceutical Research.

[38]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[39]  David H. Drewry,et al.  New Compound Sets Identified from High Throughput Phenotypic Screening Against Three Kinetoplastid Parasites: An Open Resource , 2015, Scientific Reports.

[40]  Ariel Chernomoretz,et al.  A Multilayer Network Approach for Guiding Drug Repositioning in Neglected Diseases , 2016, PLoS neglected tropical diseases.

[41]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[42]  John P. Overington,et al.  Genomic-scale prioritization of drug targets: the TDR Targets database , 2008, Nature Reviews Drug Discovery.

[43]  D J Rogers,et al.  A Computer Program for Classifying Plants. , 1960, Science.

[44]  M. Congreve,et al.  A 'rule of three' for fragment-based lead discovery? , 2003, Drug discovery today.

[45]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[46]  Xuning Wang,et al.  Genome-wide analysis of mRNA abundance in two life-cycle stages of Trypanosoma brucei and identification of splicing and polyadenylation sites , 2010, Nucleic acids research.

[48]  Jessica C Kissinger,et al.  EuPathDB: The Eukaryotic Pathogen Genomics Database Resource. , 2018, Methods in molecular biology.

[49]  P. Kersey,et al.  Using WormBase ParaSite: An Integrated Platform for Exploring Helminth Genomic Data. , 2018, Methods in molecular biology.

[50]  Yuki Moriya,et al.  KAAS: an automatic genome annotation and pathway reconstruction server , 2007, Nucleic Acids Res..

[51]  Frances M. G. Pearl,et al.  Bioinformatics in translational drug discovery , 2017, Bioscience reports.

[52]  Van V. Brantner,et al.  Estimating the cost of new drug development: is it really 802 million dollars? , 2006, Health affairs.

[53]  David S. Roos,et al.  TDR Targets: a chemogenomics resource for neglected diseases , 2011, Nucleic Acids Res..

[54]  H. Bravo,et al.  Dual Transcriptome Profiling of Leishmania-Infected Human Macrophages Reveals Distinct Reprogramming Signatures , 2016, mBio.

[55]  Zbynek Bozdech,et al.  New insights into the Plasmodium vivax transcriptome using RNA-Seq , 2016, Scientific Reports.

[56]  Jeffrey Heer,et al.  SpanningAspectRatioBank Easing FunctionS ArrayIn ColorIn Date Interpolator MatrixInterpola NumObjecPointI Rectang ISchedu Parallel Pause Scheduler Sequen Transition Transitioner Transiti Tween Co DelimGraphMLCon IData JSONCon DataField DataSc Dat DataSource Data DataUtil DirtySprite LineS RectSprite , 2011 .

[57]  Andrew R. Leach,et al.  ChEMBL: towards direct deposition of bioassay data , 2018, Nucleic Acids Res..

[58]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information , 2018, Nucleic acids research.

[59]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[60]  Kami Kim,et al.  Toxoplasma gondii: the model apicomplexan. , 2004, International journal for parasitology.

[61]  Tim Wang,et al.  A Genome-wide CRISPR Screen in Toxoplasma Identifies Essential Apicomplexan Genes , 2016, Cell.

[62]  Els Torreele,et al.  Drug development for neglected diseases: a deficient market and a public-health policy failure , 2002, The Lancet.