Systems Chemical Genetics-Based Drug Discovery: Prioritizing Agents Targeting Multiple/Reliable Disease-Associated Genes as Drug Candidates

Genetic disease genes are considered a promising source of drug targets. Most diseases are caused by more than one pathogenic factor; thus, it is reasonable to consider that chemical agents targeting multiple disease genes are more likely to have desired activities. This is supported by a comprehensive analysis on the relationships between agent activity/druggability and target genetic characteristics. The therapeutic potential of agents increases steadily with increasing number of targeted disease genes, and can be further enhanced by strengthened genetic links between targets and diseases. By using the multi-label classification models for genetics-based drug activity prediction, we provide universal tools for prioritizing drug candidates. All of the documented data and the machine-learning prediction service are available at SCG-Drug (http://zhanglab.hzau.edu.cn/scgdrug).

[1]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[2]  K. Becker,et al.  The Genetic Association Database , 2004, Nature Genetics.

[3]  Sampsa Hautaniemi,et al.  Fast Gene Ontology based clustering for microarray experiments , 2008, BioData Mining.

[4]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[5]  Andrey Rzhetsky,et al.  DiseaseConnect: a comprehensive web server for mechanism-based disease–disease connections , 2014, Nucleic Acids Res..

[6]  Feng Xu,et al.  Therapeutic target database update 2014: a resource for targeted therapeutics , 2013, Nucleic Acids Res..

[7]  D. Altshuler,et al.  Validating therapeutic targets through human genetics , 2013, Nature Reviews Drug Discovery.

[8]  A. McLysaght,et al.  Spatial Colocalization of Human Ohnolog Pairs Acts to Maintain Dosage-Balance , 2016, Molecular biology and evolution.

[9]  Evolutionary and genetic features of drug targets , 2018, Medicinal research reviews.

[10]  V. Soo,et al.  An ensemble rank learning approach for gene prioritization , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[11]  Haiyuan Yu,et al.  Three-dimensional reconstruction of protein networks provides insight into human genetic disease , 2012, Nature Biotechnology.

[12]  Mulin Jun Li,et al.  Nature Genetics Advance Online Publication a N a Ly S I S the Support of Human Genetic Evidence for Approved Drug Indications , 2022 .

[13]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[14]  Takashi Makino,et al.  Ohnologs in the human genome are dosage balanced and frequently associated with disease , 2010, Proceedings of the National Academy of Sciences.

[15]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[16]  D. Blacker,et al.  Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database , 2007, Nature Genetics.

[17]  Stephen T. C. Wong,et al.  Toward better drug repositioning: prioritizing and integrating existing methods into efficient pipelines. , 2014, Drug discovery today.

[18]  E. Vassos,et al.  Ohnologs are overrepresented in pathogenic copy number mutations , 2013, Proceedings of the National Academy of Sciences.

[19]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[20]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[21]  T. Makino,et al.  Inference of Causative Genes for Alzheimer’s Disease Due to Dosage Imbalance , 2017, Molecular biology and evolution.

[22]  Núria Queralt-Rosinach,et al.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes , 2015, Database J. Biol. Databases Curation.

[23]  L. Cardon,et al.  Use of genome-wide association studies for drug repositioning , 2012, Nature Biotechnology.

[24]  Hong-yu Zhang,et al.  Rational drug repositioning by medical genetics , 2013, Nature Biotechnology.

[25]  A. Hopkins Network pharmacology: the next paradigm in drug discovery. , 2008, Nature chemical biology.

[26]  A. Barabasi,et al.  Drug—target network , 2007, Nature Biotechnology.

[27]  Ted Pedersen,et al.  UMLS-Interface and UMLS-Similarity : Open Source Software for Measuring Paths and Semantic Similarity , 2009, AMIA.

[28]  Pjanic Milos,et al.  A gene-based association method for mapping traits using reference transcriptome data through genetically regulated expression (GReX) component, PrediXcan , 2017 .

[29]  John P A Ioannidis,et al.  Systematic meta-analyses and field synopsis of genetic association studies in schizophrenia: the SzGene database , 2008, Nature Genetics.

[30]  A Chemical-Genetic Criterion for Identifying Disease Biomarkers. , 2016, Trends in molecular medicine.

[31]  Weidong Tian,et al.  Explaining the disease phenotype of intergenic SNP through predicted long range regulation , 2016, Nucleic acids research.

[32]  Daniel S. Himmelstein,et al.  Understanding multicellular function and disease with human tissue-specific networks , 2015, Nature Genetics.

[33]  Marie-Pierre Dubé,et al.  Human monogenic disorders — a source of novel drug targets , 2006, Nature Reviews Genetics.

[34]  Yanqing Niu,et al.  Accurate Prediction of Immunogenic T-Cell Epitopes from Epitope Sequences Using the Genetic Algorithm-Based Ensemble Learning , 2015, PloS one.

[35]  Simon M Lin,et al.  Opportunities for drug repositioning from phenome-wide association studies , 2015, Nature Biotechnology.

[36]  Feng Liu,et al.  Predicting drug side effects by multi-label learning and ensemble learning , 2015, BMC Bioinformatics.

[37]  Jun S. Liu,et al.  Genetics of rheumatoid arthritis contributes to biology and drug discovery , 2013 .

[38]  Bin Li,et al.  Identification of transcription factors for drug-associated gene modules and biomedical implications , 2014, Bioinform..

[39]  Chuong B. Do,et al.  Comprehensive Research Synopsis and Systematic Meta-Analyses in Parkinson's Disease Genetics: The PDGene Database , 2012, PLoS genetics.

[40]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[41]  Nicholas C. Ide,et al.  The ClinicalTrials.gov results database--update and key issues. , 2011, The New England journal of medicine.

[42]  Hong-yu Zhang,et al.  Can medical genetics and evolutionary biology inspire drug target identification? , 2012, Trends in molecular medicine.

[43]  A. Barabasi,et al.  Network-based in silico drug efficacy screening , 2016, Nature Communications.

[44]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[45]  Deng Pan,et al.  DGIdb 2.0: mining clinically relevant drug–gene interactions , 2015, Nucleic Acids Res..

[46]  Zili Zhang,et al.  Sample Subset Optimization Techniques for Imbalanced and Ensemble Learning Problems in Bioinformatics Applications , 2014, IEEE Transactions on Cybernetics.

[47]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.