HEDD: Human Enhancer Disease Database

Abstract Enhancers, as specialized genomic cis-regulatory elements, activate transcription of their target genes and play an important role in pathogenesis of many human complex diseases. Despite recent systematic identification of them in the human genome, currently there is an urgent need for comprehensive annotation databases of human enhancers with a focus on their disease connections. In response, we built the Human Enhancer Disease Database (HEDD) to facilitate studies of enhancers and their potential roles in human complex diseases. HEDD currently provides comprehensive genomic information for ∼2.8 million human enhancers identified by ENCODE, FANTOM5 and RoadMap with disease association scores based on enhancer–gene and gene–disease connections. It also provides Web-based analytical tools to visualize enhancer networks and score enhancers given a set of selected genes in a specific gene network. HEDD is freely accessible at http://zdzlab.einstein.yu.edu/1/hedd.php.

[1]  Peter C Scacheri,et al.  Enhancer variants: evaluating functions in common disease , 2014, Genome Medicine.

[2]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[3]  Bing He,et al.  EnhancerAtlas: a resource for enhancer annotation and analysis in 105 human cell/tissue types , 2016, Bioinform..

[4]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[5]  Doron Lancet,et al.  MalaCards: an integrated compendium for diseases and their annotation , 2013, Database J. Biol. Databases Curation.

[6]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[7]  M. Groudine,et al.  Functional and Mechanistic Diversity of Distal Transcription Enhancers , 2011, Cell.

[8]  William Stafford Noble,et al.  Integrative annotation of chromatin elements from ENCODE data , 2012, Nucleic acids research.

[9]  A. Iwase,et al.  Common Variants on Chromosome 9p21 Are Associated with Normal Tension Glaucoma , 2012, PloS one.

[10]  T. Hoang,et al.  SCL Assembles a Multifactorial Complex That Determines Glycophorin A Expression , 2004, Molecular and Cellular Biology.

[11]  Brian T. Lee,et al.  The UCSC Genome Browser database: 2015 update , 2014, Nucleic Acids Res..

[12]  J Erikson,et al.  Transcriptional activation of the translocated c-myc oncogene in burkitt lymphoma. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Simon G. Coetzee,et al.  Comprehensive Functional Annotation of 77 Prostate Cancer Risk Loci , 2014, PLoS genetics.

[14]  Bridget E. Begg,et al.  A Proteome-Scale Map of the Human Interactome Network , 2014, Cell.

[15]  Bin Zhang,et al.  SEA: a super-enhancer archive , 2015, Nucleic Acids Res..

[16]  Pak Chung Sham,et al.  GWASdb v2: an update database for human genetic variants identified by genome-wide association studies , 2015, Nucleic Acids Res..

[17]  J. T. Kadonaga,et al.  Going the distance: a current view of enhancer action. , 1998, Science.

[18]  Cesare Furlanello,et al.  A promoter-level mammalian expression atlas , 2015 .

[19]  J. Taipale,et al.  The role of enhancers in cancer , 2016, Nature Reviews Cancer.

[20]  M. Brown,et al.  Genome-wide association study identifies susceptibility loci for open angle glaucoma at TMCO1 and CDKN2B-AS1 , 2011, Nature Genetics.

[21]  Munish Mehta,et al.  A genome-wide association study reveals susceptibility loci for myocardial infarction/coronary artery disease in Saudi Arabs. , 2016, Atherosclerosis.

[22]  Janos X. Binder,et al.  DISEASES: Text mining and data integration of disease–gene associations , 2014, bioRxiv.

[23]  Manolis Kellis,et al.  Conserved epigenomic signals in mice and humans reveal immune basis of Alzheimer’s disease , 2015, Nature.

[24]  Núria Queralt-Rosinach,et al.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes , 2015, Database J. Biol. Databases Curation.

[25]  T. Assimes,et al.  Identification of ADAMTS7 as a novel locus for coronary atherosclerosis and association of ABO with myocardial infarction in the presence of coronary atherosclerosis: two genome-wide association studies , 2011, The Lancet.

[26]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[27]  Igor Jurisica,et al.  Integrated interactions database: tissue-specific view of the human and model organism interactomes , 2015, Nucleic Acids Res..

[28]  Manolis Kellis,et al.  Common Variants at 9p21 and 8q22 Are Associated with Increased Susceptibility to Optic Nerve Degeneration in Glaucoma , 2012, PLoS genetics.

[29]  B. L,et al.  The accessible chromatin landscape of the human genome , 2016 .

[30]  David Haussler,et al.  ENCODE Data in the UCSC Genome Browser: year 5 update , 2012, Nucleic Acids Res..

[31]  Robert N Weinreb,et al.  Genome-wide association analysis identifies TXNRD2, ATXN2 and FOXC1 as susceptibility loci for primary open angle glaucoma , 2015, Nature Genetics.

[32]  Zhengdong D. Zhang,et al.  SubNet: a Java application for subnetwork extraction , 2013, Bioinform..

[33]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[34]  Geoffrey J. Barton,et al.  PIPs: human protein–protein interaction prediction database , 2008, Nucleic Acids Res..

[35]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[36]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[37]  Vladimir B. Bajic,et al.  DENdb: database of integrated human enhancers , 2015, Database J. Biol. Databases Curation.

[38]  Bernard Keavney,et al.  Chromosome 9p21 SNPs Associated with Multiple Disease Phenotypes Correlate with ANRIL Expression , 2010, PLoS genetics.

[39]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[40]  O. Andreassen,et al.  Shared genetic risk between migraine and coronary artery disease: A genome-wide analysis of common variants , 2017, PloS one.

[41]  Martin H. Schaefer,et al.  HIPPIE v2.0: enhancing meaningfulness and reliability of protein–protein interaction networks , 2016, Nucleic Acids Res..

[42]  Janey L. Wiggs,et al.  Common variants near ABCA1, AFAP1 and GMDS confer risk of primary open-angle glaucoma , 2014, Nature Genetics.

[43]  Yusuke Nakamura,et al.  A genome-wide association study in the Japanese population confirms 9p21 and 14q23 as susceptibility loci for primary open angle glaucoma. , 2012, Human molecular genetics.

[44]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[45]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[46]  Y Fujiwara,et al.  Different sequence requirements for expression in erythroid and megakaryocytic cells within a regulatory element upstream of the GATA-1 gene. , 1999, Development.

[47]  M. Rosenfeld,et al.  Enhancers as non-coding RNA transcription units: recent insights and future perspectives , 2016, Nature Reviews Genetics.

[48]  Christian Gieger,et al.  Genetic Predictors of Fibrin D-Dimer Levels in Healthy Adults , 2011, Circulation.

[49]  C. Hayward,et al.  Dataset pertaining to the publication “Loci Associated with N-Glycosylation of Human Immunoglobulin G Show Pleiotropy with Autoimmune Diseases and Haematological Cancers” , 2016 .

[50]  P. Flicek,et al.  The Ensembl Regulatory Build , 2015, Genome Biology.

[51]  Gerard Tromp,et al.  Epistatic Gene-Based Interaction Analyses for Glaucoma in eMERGE and NEIGHBOR Consortium , 2016, PLoS genetics.

[52]  Stephan Ripke,et al.  Genome-wide association study of coronary and aortic calcification implicates risk loci for coronary artery disease and myocardial infarction. , 2013, Atherosclerosis.

[53]  M. Kageyama,et al.  Common Variants in CDKN2B-AS1 Associated with Optic-Nerve Vulnerability of Glaucoma Identified by Genome-Wide Association Studies in Japanese , 2012, PloS one.

[54]  C. Kai,et al.  CAGE: cap analysis of gene expression , 2006, Nature Methods.

[55]  Shamit Soneji,et al.  Genome-wide identification of TAL1's functional targets: insights into its mechanisms of action in primary erythroid cells. , 2010, Genome research.

[56]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[57]  Haiyuan Yu,et al.  HINT: High-quality protein interactomes and their applications in understanding human disease , 2012, BMC Systems Biology.

[58]  Udo Hoffmann,et al.  Genome-Wide Association Study for Coronary Artery Calcification With Follow-Up in Myocardial Infarction , 2011, Circulation.

[59]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[60]  J Erikson,et al.  Differential expression of the translocated and the untranslocated c-myc oncogene in Burkitt lymphoma. , 1983, Science.

[61]  Matthias E. Futschik,et al.  UniHI 7: an enhanced database for retrieval and interactive analysis of human molecular interaction networks , 2013, Nucleic Acids Res..

[62]  Hans-Martin Herz Enhancer deregulation in cancer and other diseases , 2016, BioEssays : news and reviews in molecular, cellular and developmental biology.

[63]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[64]  M. Daly,et al.  Genetic and Epigenetic Fine-Mapping of Causal Autoimmune Disease Variants , 2014, Nature.

[65]  J. Danesh,et al.  A comprehensive 1000 Genomes-based genome-wide association meta-analysis of coronary artery disease , 2016 .

[66]  Aziz Khan,et al.  dbSUPER: a database of super-enhancers in mouse and human genome , 2015, bioRxiv.