EpiDenovo: a platform for linking regulatory de novo mutations to developmental epigenetics and diseases

Abstract De novo mutations (DNMs) have been shown to be a major cause of severe early-onset genetic disorders such as autism spectrum disorder and intellectual disability. Over one million DNMs have been identified in developmental disorders by next generation sequencing, but linking these DNMs to the genes that they impact remains a challenge, as the majority of them are embedded in non-coding regions. As most developmental diseases occur in the early stages of development or during childhood, it is crucial to clarify the details of epigenetic regulation in early development in order to interpret the mechanisms underlying developmental disorders. Here, we develop EpiDenovo, a database that is freely available at http://www.epidenovo.biols.ac.cn/, and which provides the associations between embryonic epigenomes and DNMs in developmental disorders, including several neuropsychiatric disorders and congenital heart disease. EpiDenovo provides an easy-to-use web interface allowing users rapidly to find the epigenetic signatures of DNMs and the expression patterns of the genes that they regulate during embryonic development. In summary, EpiDenovo is a useful resource for selecting candidate genes for further functional studies in embryonic development, and for investigating regulatory DNMs as well as other genetic variants causing or underlying developmental disorders.

[1]  C. Glass,et al.  The selection and function of cell type-specific enhancers , 2015, Nature Reviews Molecular Cell Biology.

[2]  Ji Wu,et al.  Reproductive Biology and Endocrinology Open Access Epigenetic Regulation in Mammalian Preimplantation Embryo Development , 2022 .

[3]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[4]  S. Salzberg,et al.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads , 2015, Nature Biotechnology.

[5]  Davis J. McCarthy,et al.  Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation , 2012, Nucleic acids research.

[6]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[7]  Alexander E. Kel,et al.  GTRD: a database of transcription factor binding sites identified by ChIP-seq experiments , 2016, Nucleic Acids Res..

[8]  Alexander Hoischen,et al.  New insights into the generation and role of de novo mutations in health and disease , 2016, Genome Biology.

[9]  Wei Wang,et al.  Predicting enhancer transcription and activity from chromatin modifications , 2013, Nucleic acids research.

[10]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[11]  Guangchuang Yu,et al.  clusterProfiler: an R package for comparing biological themes among gene clusters. , 2012, Omics : a journal of integrative biology.

[12]  Jean-Philippe Vert,et al.  HiC-Pro: an optimized and flexible pipeline for Hi-C data processing , 2015, Genome Biology.

[13]  K. Kurimoto,et al.  Epigenetic reprogramming in mouse pre-implantation development and primordial germ cells , 2012, Development.

[14]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[15]  Kenta Nakai,et al.  DBTMEE: a database of transcriptome in mouse early embryos , 2014, Nucleic Acids Res..

[16]  Zhong Sheng Sun,et al.  De novo mutations involved in post-transcriptional dysregulation contribute to six neuropsychiatric disorders , 2017 .

[17]  Eurie L. Hong,et al.  Annotation of functional variation in personal genomes using RegulomeDB , 2012, Genome research.

[18]  Li Teng,et al.  4DGenome: a comprehensive database of chromatin interactions , 2015, Bioinform..

[19]  Cheng Quan,et al.  3DSNP: a database for linking human noncoding SNPs to their three-dimensional interacting genes , 2016, Nucleic Acids Res..

[20]  Raphael A. Bernier,et al.  denovo-db: a compendium of human de novo variants , 2016, Nucleic Acids Res..

[21]  Daning Lu,et al.  Chromosome conformation elucidates regulatory relationships in developing human brain , 2016, Nature.

[22]  Caroline F. Wright,et al.  De novo mutations in regulatory elements cause neurodevelopmental disorders , 2017, bioRxiv.

[23]  S. Horvath,et al.  Genetic programs in human and mouse early embryos revealed by single-cell RNA sequencing , 2013, Nature.

[24]  Hemi Malkki,et al.  Neurodevelopmental disorders: Altered epigenetic regulation in early development associated with schizophrenia , 2016, Nature Reviews Neurology.

[25]  Qinjun Huang,et al.  PoplarGene: poplar gene network and resource for mining functional information for genes from woody plants , 2016, Scientific Reports.

[26]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[27]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[28]  Fidel Ramírez,et al.  deepTools: a flexible platform for exploring deep-sequencing data , 2014, Nucleic Acids Res..

[29]  Jie Wang,et al.  Factorbook.org: a Wiki-based database for transcription factor-binding data generated by the ENCODE consortium , 2012, Nucleic Acids Res..

[30]  Wei Xie,et al.  The landscape of accessible chromatin in mammalian preimplantation embryos , 2016, Nature.

[31]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[32]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[33]  Marco-Antonio Mendoza-Parra,et al.  Antibody performance in ChIP-sequencing assays: From quality scores of public data sets to quantitative certification. , 2016, F1000Research.

[34]  David G Hendrickson,et al.  Differential analysis of gene regulation at transcript resolution with RNA-seq , 2012, Nature Biotechnology.

[35]  Xianfeng Li,et al.  RBP-Var: a database of functional variants involved in regulation mediated by RNA-binding proteins , 2015, Nucleic Acids Res..

[36]  Florian Halbritter,et al.  GeneProf data: a resource of curated, integrated and reusable high-throughput genomics experiments , 2013, Nucleic Acids Res..

[37]  João Pedro de Magalhães,et al.  Gene co-expression analysis for functional classification and gene–disease predictions , 2017, Briefings Bioinform..

[38]  Tao Liu,et al.  Cistrome Data Browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse , 2016, Nucleic Acids Res..

[39]  Heinrich Sticht,et al.  De novo mutations in the genome organizer CTCF cause intellectual disability. , 2013, American journal of human genetics.

[40]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[41]  Li Teng,et al.  4DGenome: a comprehensive database of chromatin interactions , 2015, Bioinform..

[42]  Ryan M. Layer,et al.  SpeedSeq: Ultra-fast personal genome analysis and interpretation , 2014, Nature Methods.

[43]  Jesse M. Engreitz,et al.  A Genetic Variant Associated with Five Vascular Diseases Is a Distal Regulator of Endothelin-1 Gene Expression , 2017, Cell.

[44]  Adam Burton,et al.  Chromatin dynamics in the regulation of cell fate allocation during early embryogenesis , 2014, Nature Reviews Molecular Cell Biology.

[45]  Hui Zhou,et al.  ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data , 2016, Nucleic Acids Res..

[46]  D. Geschwind,et al.  Histone Acetylome-wide Association Study of Autism Spectrum Disorder , 2016, Cell.

[47]  Yong Zhang,et al.  Identifying ChIP-seq enrichment using MACS , 2012, Nature Protocols.

[48]  Michel Botbol,et al.  Gene × Environment Interactions in Autism Spectrum Disorders: Role of Epigenetic Mechanisms , 2014, Front. Psychiatry.

[49]  Iuliana Ionita-Laza,et al.  De Novo Synonymous Mutations in Regulatory Elements Contribute to the Genetic Etiology of Autism and Schizophrenia , 2016, Neuron.

[50]  L. Mirny,et al.  Iterative Correction of Hi-C Data Reveals Hallmarks of Chromosome Organization , 2012, Nature Methods.

[51]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[52]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[53]  Mohammad M. Karimi,et al.  An ultra-low-input native ChIP-seq protocol for genome-wide profiling of rare cell populations , 2015, Nature Communications.