Genomic Selective Constraints in Murid Noncoding DNA

Recent work has suggested that there are many more selectively constrained, functional noncoding than coding sites in mammalian genomes. However, little is known about how selective constraint varies amongst different classes of noncoding DNA. We estimated the magnitude of selective constraint on a large dataset of mouse-rat gene orthologs and their surrounding noncoding DNA. Our analysis indicates that there are more than three times as many selectively constrained, nonrepetitive sites within noncoding DNA as in coding DNA in murids. The majority of these constrained noncoding sites appear to be located within intergenic regions, at distances greater than 5 kilobases from known genes. Our study also shows that in murids, intron length and mean intronic selective constraint are negatively correlated with intron ordinal number. Our results therefore suggest that functional intronic sites tend to accumulate toward the 5′ end of murid genes. Our analysis also reveals that mean number of selectively constrained noncoding sites varies substantially with the function of the adjacent gene. We find that, among others, developmental and neuronal genes are associated with the greatest numbers of putatively functional noncoding sites compared with genes involved in electron transport and a variety of metabolic processes. Combining our estimates of the total number of constrained coding and noncoding bases we calculate that over twice as many deleterious mutations have occurred in intergenic regions as in known genic sequence and that the total genomic deleterious point mutation rate is 0.91 per diploid genome, per generation. This estimated rate is over twice as large as a previous estimate in murids.

[1]  Aleksey Y Ogurtsov,et al.  Selection in favor of nucleotides G and C diversifies evolution rates and levels of polymorphism at mammalian synonymous sites. , 2006, Journal of theoretical biology.

[2]  S. Boissinot,et al.  Fitness cost of LINE-1 (L1) activity in humans. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[3]  D. Haussler,et al.  A distal enhancer and an ultraconserved exon are derived from a novel retroposon , 2006, Nature.

[4]  A. Vinogradov "Genome design" model: evidence from conserved intronic sequence in human-mouse comparison. , 2006, Genome research.

[5]  E. Lander,et al.  A large family of ancient repeat elements in the human genome is under strong selection. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[6]  L. Hurst,et al.  Evidence for purifying selection against synonymous mutations in mammalian exonic splicing enhancers. , 2006, Molecular biology and evolution.

[7]  Shamil Sunyaev,et al.  Evolutionary constraints in conserved nongenic sequences of mammals. , 2005, Genome research.

[8]  N. Bresolin,et al.  Fixation of conserved sequences shapes human intron size and influences transposon-insertion dynamics. , 2005, Trends in genetics : TIG.

[9]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[10]  L. Hurst,et al.  Evidence for selection on synonymous mutations affecting stability of mRNA secondary structure in mammals , 2005, Genome Biology.

[11]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[12]  Daniel J. Gaffney,et al.  The scale of mutational variation in the murid genome. , 2005, Genome research.

[13]  L. Hurst,et al.  Gametophytic Selection in Arabidopsis thaliana Supports the Selective Model of Intron Length Reduction , 2005, PLoS genetics.

[14]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[15]  Hans Ellegren,et al.  Male-driven biased gene conversion governs the evolution of base composition in human alu repeats. , 2005, Molecular biology and evolution.

[16]  Laurent Duret,et al.  Homology-dependent methylation in primate repetitive DNA. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[17]  A. Reymond,et al.  Conserved non-genic sequences — an unexpected feature of mammalian genomes , 2005, Nature Reviews Genetics.

[18]  M. Lercher,et al.  Explorer Evidence for Widespread Degradation of Gene Control Regions in Hominid Genomes , 2015 .

[19]  Hiroaki Kitano,et al.  The PANTHER database of protein families, subfamilies, functions and pathways , 2004, Nucleic Acids Res..

[20]  Damian Smedley,et al.  Ensembl 2005 , 2004, Nucleic Acids Res..

[21]  Dirk Holste,et al.  Single Nucleotide Polymorphism–Based Validation of Exonic Splicing Enhancers , 2004, PLoS biology.

[22]  L. Hurst,et al.  Similar rates but different modes of sequence evolution in introns and at exonic silent sites in rodents: evidence for selectively driven codon usage. , 2004, Molecular biology and evolution.

[23]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[24]  A. Vinogradov Compactness of human housekeeping genes: selection for economy or genomic design? , 2004, Trends in genetics : TIG.

[25]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[26]  S. Batzoglou,et al.  Characterization of evolutionary rates and constraints in three Mammalian genomes. , 2004, Genome research.

[27]  S. Carroll,et al.  The regulatory content of intergenic DNA shapes genome architecture , 2004, Genome Biology.

[28]  D. Haussler,et al.  Article Identification and Characterization of Multi-Species Conserved Sequences , 2022 .

[29]  Alexandre Reymond,et al.  Evolutionary Discrimination of Mammalian Conserved Non-Genic Sequences (CNGs) , 2003, Science.

[30]  Daniel J. Gaffney,et al.  Functional constraints and frequency of deleterious mutations in noncoding DNA of rodents , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Araxi O. Urrutia,et al.  The signature of selection mediated by expression on human genes. , 2003, Genome research.

[32]  Nancy F. Hansen,et al.  Comparative analyses of multi-species sequences from targeted genomic regions , 2003, Nature.

[33]  E. Levanon,et al.  Human housekeeping genes are compact. , 2003, Trends in genetics : TIG.

[34]  R. Sorek,et al.  Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. , 2003, Genome research.

[35]  Erez Y. Levanon,et al.  Widespread occurrence of antisense transcription in the human genome , 2003, Nature Biotechnology.

[36]  C. V. Jongeneel,et al.  Numerous potentially functional but non-genic conserved sequences on human chromosome 21 , 2002, Nature.

[37]  S. P. Fodor,et al.  Evolutionarily conserved sequences on human chromosome 21. , 2001, Genome research.

[38]  P. Keightley,et al.  Deleterious mutations and the evolution of sex. , 2000, Science.

[39]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[40]  M. Batzer,et al.  Alu repeats and human disease. , 1999, Molecular genetics and metabolism.

[41]  Peter D. Keightley,et al.  High genomic deleterious mutation rates in hominids , 1999, Nature.

[42]  R. Britten,et al.  Mobile elements inserted in the distant past have taken on important functions. , 1997, Gene.

[43]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[44]  S. Boissinot,et al.  Mutation Pattern Variation Among Regions of the Primate Genome , 1997, Journal of Molecular Evolution.

[45]  P. Hanawalt Transcription-coupled repair and human disease. , 1994, Science.

[46]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[47]  M. Nei,et al.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. , 1993, Molecular biology and evolution.

[48]  Wen-Hsiung Li,et al.  Nonrandomness of point mutation as reflected in nucleotide substitutions in pseudogenes and its evolutionary implications , 2005, Journal of Molecular Evolution.

[49]  Rat Genome Sequencing Project Consortium Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004 .

[50]  K. Holsinger The neutral theory of molecular evolution , 2004 .

[51]  Nicholas L. Bray,et al.  AVID: A global alignment program. , 2003, Genome research.

[52]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[53]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[54]  J. Crow,et al.  A molecular approach to estimating the human deleterious mutation rate , 1993, Human mutation.

[55]  T. Nagylaki Evolution of a finite population under gene conversion. , 1983, Proceedings of the National Academy of Sciences of the United States of America.