De novo mutations in regulatory elements cause neurodevelopmental disorders

Summary De novo mutations in hundreds of different genes collectively cause 25-42% of severe developmental disorders (DD). The cause in the remaining cases is largely unknown. The role of de novo mutations in regulatory elements affecting known DD associated genes or other genes is essentially unexplored. We identified de novo mutations in three classes of putative regulatory elements in almost 8,000 DD patients. Here we show that de novo mutations in highly conserved fetal-brain active elements are significantly and specifically enriched in neurodevelopmental disorders. We identified a significant two-fold enrichment of recurrently mutated elements. We estimate that, genome-wide, de novo mutations in fetaLbrain active elements are likely to be causal for 1-3% of patients without a diagnostic coding variant and that only a small fraction (<2%) of de novo mutations in these elements are pathogenic. Our findings represent a robust estimate of the contribution of de novo mutations in regulatory elements to this genetically heterogeneous set of disorders, and emphasise the importance of combining functional and evolutionary evidence to delineate regulatory causes of genetic disorders.

[1]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[2]  Shamil R Sunyaev,et al.  Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. , 2007, American journal of human genetics.

[3]  C. Baker,et al.  Genome Sequencing of Autism-Affected Families Reveals Disruption of Putative Noncoding Regulatory DNA. , 2016, American journal of human genetics.

[4]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[5]  K. Pollard,et al.  Enhancer–promoter interactions are encoded by complex genomic signatures on looping chromatin , 2016, Nature Genetics.

[6]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[7]  Robert Gentleman,et al.  Software for Computing and Annotating Genomic Ranges , 2013, PLoS Comput. Biol..

[8]  Deciphering Developmental Disorders Study,et al.  Prevalence and architecture of de novo mutations in developmental disorders , 2017, Nature.

[9]  Alejandro Sifrim,et al.  Genetic diagnosis of developmental disorders in the DDD study: a scalable analysis of genome-wide research data , 2015, The Lancet.

[10]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[11]  David I. Wilson,et al.  Long-range evolutionary constraints reveal cis-regulatory interactions on the human X chromosome , 2015, Nature Communications.

[12]  Daning Lu,et al.  Chromosome conformation elucidates regulatory relationships in developing human brain , 2016, Nature.

[13]  Nga Thi Thuy Nguyen,et al.  Genomicus update 2015: KaryoView and MatrixView provide a genome-wide perspective to multispecies comparative genomics , 2014, Nucleic Acids Res..

[14]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[15]  Inna Dubchak,et al.  VISTA Enhancer Browser—a database of tissue-specific human enhancers , 2006, Nucleic Acids Res..

[16]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[17]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[18]  Morad Ansari,et al.  Discovery of four recessive developmental disorders using probabilistic genotype and phenotype matching among 4,125 families , 2015, Nature Genetics.

[19]  A. Munnich,et al.  Disruption of a long distance regulatory region upstream of SOX9 in isolated disorders of sex development , 2011, Journal of Medical Genetics.

[20]  A. McCallion,et al.  Genomics of long-range regulatory elements. , 2010, Annual review of genomics and human genetics.

[21]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[22]  Giorgio Valentini,et al.  A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease. , 2016, American journal of human genetics.

[23]  J. Dekker,et al.  The long-range interaction landscape of gene promoters , 2012, Nature.

[24]  Boris Lenhard,et al.  Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes , 2004, BMC Genomics.

[25]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[26]  Soher Balkhy,et al.  Mutations in Human Accelerated Regions Disrupt Cognition and Social Behavior , 2016, Cell.

[27]  Xia Li,et al.  Regulation of a remote Shh forebrain enhancer by the Six3 homeoprotein , 2008, Nature Genetics.

[28]  Eric S. Lander,et al.  A polygenic burden of rare disruptive mutations in schizophrenia , 2014, Nature.

[29]  Tom R. Gaunt,et al.  The UK10K project identifies rare variants in health and disease , 2016 .

[30]  W. Wasserman,et al.  Identification of altered cis-regulatory elements in human disease. , 2015, Trends in genetics : TIG.

[31]  Arthur Wuster,et al.  DeNovoGear: de novo indel and point mutation discovery and phasing , 2013, Nature Methods.

[32]  David J. Arenillas,et al.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles , 2015, Nucleic Acids Res..

[33]  Stefan Mundlos,et al.  Looking beyond the genes: the role of non-coding variants in human disease. , 2016, Human molecular genetics.

[34]  Timothy L. Bailey,et al.  Motif Enrichment Analysis: a unified framework and an evaluation on ChIP data , 2010, BMC Bioinformatics.

[35]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[36]  A. Visel,et al.  Large-Scale Discovery of Enhancers from Human Heart Tissue , 2011, Nature Genetics.

[37]  B. Oostra,et al.  A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. , 2003, Human molecular genetics.

[38]  Stephan J Sanders,et al.  A framework for the interpretation of de novo mutation in human disease , 2014, Nature Genetics.

[39]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[40]  C. Cotsapas,et al.  Integrative genetic and epigenetic analysis uncovers regulatory mechanisms of autoimmune disease , 2016, bioRxiv.

[41]  Tudor Groza,et al.  The Human Phenotype Ontology in 2017 , 2016, Nucleic Acids Res..

[42]  L. Lettice,et al.  Alterations to the remote control of Shh gene expression cause congenital abnormalities , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[43]  Anna Murray,et al.  Recessive mutations in a distal PTF1A enhancer cause isolated pancreatic agenesis , 2013, Nature Genetics.

[44]  Colin Campbell,et al.  An integrative approach to predicting the functional effects of non-coding and coding sequence variation , 2015, Bioinform..

[45]  Margaret B. Fish,et al.  Disruption of autoregulatory feedback by a mutation in a remote, ultraconserved PAX6 enhancer causes aniridia. , 2013, American journal of human genetics.