eDGAR: a database of Disease-Gene Associations with annotated Relationships among genes

BackgroundGenetic investigations, boosted by modern sequencing techniques, allow dissecting the genetic component of different phenotypic traits. These efforts result in the compilation of lists of genes related to diseases and show that an increasing number of diseases is associated with multiple genes. Investigating functional relations among genes associated with the same disease contributes to highlighting molecular mechanisms of the pathogenesis.ResultsWe present eDGAR, a database collecting and organizing the data on gene/disease associations as derived from OMIM, Humsavar and ClinVar. For each disease-associated gene, eDGAR collects information on its annotation. Specifically, for lists of genes, eDGAR provides information on: i) interactions retrieved from PDB, BIOGRID and STRING; ii) co-occurrence in stable and functional structural complexes; iii) shared Gene Ontology annotations; iv) shared KEGG and REACTOME pathways; v) enriched functional annotations computed with NET-GE; vi) regulatory interactions derived from TRRUST; vii) localization on chromosomes and/or co-localisation in neighboring loci. The present release of eDGAR includes 2672 diseases, related to 3658 different genes, for a total number of 5729 gene-disease associations. 71% of the genes are linked to 621 multigenic diseases and eDGAR highlights their common GO terms, KEGG/REACTOME pathways, physical and regulatory interactions. eDGAR includes a network based enrichment method for detecting statistically significant functional terms associated to groups of genes.ConclusionseDGAR offers a resource to analyze disease-gene associations. In multigenic diseases genes can share physical interactions and/or co-occurrence in the same functional processes. eDGAR is freely available at: edgar.biocomp.unibo.it

[1]  The Gene Ontology Consortium Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[2]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[3]  P. Stenson,et al.  The Human Gene Mutation Database (HGMD) and Its Exploitation in the Fields of Personalized Genomics and Molecular Evolution , 2012, Current protocols in bioinformatics.

[4]  L. Rejnmark,et al.  Hypoparathyroidism in the adult: Epidemiology, diagnosis, pathophysiology, target‐organ involvement, treatment, and challenges for future research , 2011, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[5]  J. Flier,et al.  Fibroblast growth factor 21 and thyroid hormone show mutual regulatory dependency but have independent actions in vivo. , 2014, Endocrinology.

[6]  Hong Wang,et al.  Serum fibroblast growth factor 19 is decreased in patients with overt hypothyroidism and subclinical hypothyroidism , 2016, Medicine.

[7]  Núria Queralt-Rosinach,et al.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants , 2016, Nucleic Acids Res..

[8]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[9]  François Schiettecatte,et al.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders , 2014, Nucleic Acids Res..

[10]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[11]  Sarah C. Ayling,et al.  The Ensembl gene annotation system , 2016, Database J. Biol. Databases Curation.

[12]  Doron Lancet,et al.  MalaCards: an amalgamated human disease compendium with diverse clinical and genetic annotation and structured search , 2016, Nucleic Acids Res..

[13]  M. S. Bauer,et al.  Effects of hypothyroidism on rat circadian activity and temperature rhythms and their response to light , 1992, Biological Psychiatry.

[14]  Piero Fariselli,et al.  NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases , 2015, BMC Genomics.

[15]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[16]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[17]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[18]  Guang Wang,et al.  Novel Clinical Evidence of an Association between Homocysteine and Insulin Resistance in Patients with Hypothyroidism or Subclinical Hypothyroidism , 2015, PloS one.

[19]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[20]  Wenqing Fu,et al.  Genetic architecture of quantitative traits and complex diseases. , 2013, Current opinion in genetics & development.

[21]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[22]  Andrei L. Turinsky,et al.  A Census of Human Soluble Protein Complexes , 2012, Cell.

[23]  Piero Fariselli,et al.  NET-GE: a web-server for NETwork-based human gene enrichment , 2016, Bioinform..

[24]  H. Jang,et al.  Plasma FGF21 levels are increased in patients with hypothyroidism independently of lipid profile. , 2013, Endocrine journal.

[25]  Jung Eun Shim,et al.  TRRUST: a reference database of human transcriptional regulatory interactions , 2015, Scientific Reports.

[26]  D E Weeks,et al.  Polygenic disease: methods for mapping complex disease traits. , 1995, Trends in genetics : TIG.

[27]  Tom Lenaerts,et al.  NAR Breakthrough Article: DIDA: A curated and annotated digenic diseases database , 2016, Nucleic Acids Res..

[28]  M. King,et al.  Genetic Heterogeneity in Human Disease , 2010, Cell.

[29]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[30]  L. Cardon,et al.  Precision medicine, genomics and drug discovery. , 2016, Human molecular genetics.

[31]  Maricel G. Kann,et al.  Advances in translational bioinformatics: computational approaches for the hunting of disease genes , 2010, Briefings Bioinform..

[32]  M. Oti,et al.  The modular nature of genetic diseases , 2006, Clinical genetics.

[33]  Arek Kasprzyk,et al.  BioMart: driving a paradigm change in biological data management , 2011, Database J. Biol. Databases Curation.

[34]  Ricardo Villamarín-Salomón,et al.  ClinVar: public archive of interpretations of clinically relevant variants , 2015, Nucleic Acids Res..

[35]  A. Bretaudeau,et al.  The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes , 2012, PloS one.

[36]  Seth Carbon,et al.  Get GO! Retrieving GO Data Using AmiGO, QuickGO, API, Files, and Tools. , 2017, Methods in molecular biology.

[37]  The Gene Ontology Consortium,et al.  Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[38]  Minoru Kanehisa,et al.  KEGG as a reference resource for gene and protein annotation , 2015, Nucleic Acids Res..