Small fitness effect of mutations in highly conserved non-coding regions.

Comparison of human and mouse genomes has revealed that many non-coding regions have levels of sequence conservation similar to protein-coding genes. These regions have attracted a lot of attention as potentially functional genomic sequences. However, little is known about the effect mutations in these conserved non-coding regions have on fitness and how many of them are present in the human genome as deleterious polymorphisms. To gain insight into the selective constraints imposed on conserved non-coding and protein-coding regions, we compared substitution rates in primate and rodent lineages and analyzed the density and allele frequencies of human polymorphism. Genomic regions conserved between primate and rodent groups show higher relative conservation within rodents than within primates. Thus, our analysis indicates a genome-wide relaxation of selective constraint in the primate lineage, which most likely resulted from a smaller effective population size. We found that this relaxation is much more profound in conserved non-coding regions than in protein-coding regions, and that mutations at a large proportion of sites in conserved non-coding regions are associated with very small fitness effect. Data on human polymorphism are also consistent with very weak selection in conserved non-coding regions. This staggering enrichment in sites at the borderline of neutrality can be explained by assuming an important role for synergistic epistasis in the evolution of non-coding regions. Our results suggest that most individual mutations in conserved non-coding regions are only slightly deleterious but are numerous and may have a significant cumulative impact on fitness.

[1]  M. Lercher,et al.  Explorer Evidence for Widespread Degradation of Gene Control Regions in Hominid Genomes , 2015 .

[2]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[3]  Andrew G. Clark,et al.  Evolutionary changes in cis and trans gene regulation , 2004, Nature.

[4]  M. Nóbrega,et al.  Comparative genomics at the vertebrate extremes , 2004, Nature Reviews Genetics.

[5]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[6]  E. Birney,et al.  Comparison of human chromosome 21 conserved nongenic sequences (CNGs) with the mouse and dog genomes shows that their selective constraint is independent of their genic environment. , 2004, Genome research.

[7]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[8]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[9]  Johannes Berg,et al.  Adaptive evolution of transcription factor binding sites , 2003, BMC Evolutionary Biology.

[10]  David Haussler,et al.  Into the heart of darkness: large-scale clustering of human non-coding DNA , 2004, ISMB/ECCB.

[11]  Daniel J. Gaffney,et al.  Functional constraints and frequency of deleterious mutations in noncoding DNA of rodents , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[12]  T. Hudson Wanted: regulatory SNPs , 2003, Nature Genetics.

[13]  D. Botstein,et al.  Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease , 2003, Nature Genetics.

[14]  M. Lässig,et al.  On the evolution of gene regulation , 2003 .

[15]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[16]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[17]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[18]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[19]  Daniel J. Gaffney,et al.  Quantifying the slightly deleterious mutation model of molecular evolution. , 2002, Molecular biology and evolution.

[20]  David Applebaum,et al.  Stochastic processes : an introduction , 2002, The Mathematical Gazette.

[21]  E. Lander,et al.  Detection of regulatory variation in mouse genes , 2002, Nature Genetics.

[22]  P. Bork,et al.  Human non-synonymous SNPs: server and survey. , 2002, Nucleic acids research.

[23]  Alexey S Kondrashov,et al.  Classification of common conserved sequences in mammalian intergenic regions. , 2002, Human molecular genetics.

[24]  Yusuke Nakamura,et al.  JSNP: a database of common gene variations in the Japanese population , 2002, Nucleic Acids Res..

[25]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[26]  Justin C. Fay,et al.  Positive and negative selection on the human genome. , 2001, Genetics.

[27]  Warren C. Lathe,et al.  Prediction of deleterious human alleles. , 2001, Human molecular genetics.

[28]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[29]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[30]  Donna R. Maglott,et al.  RefSeq and LocusLink: NCBI gene-centered resources , 2001, Nucleic Acids Res..

[31]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[32]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[33]  R. Guigó,et al.  GeneID in Drosophila. , 2000, Genome research.

[34]  A. D. Peters,et al.  Testing for epistasis between deleterious mutations. , 1998, Genetics.

[35]  M. Nachman Patterns of DNA variability at X-linked loci in Mus domesticus. , 1997, Genetics.

[36]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[37]  H. Akashi,et al.  Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster. , 1996, Genetics.

[38]  M. Nauta Population genetics, molecular evolution, and the neutral theory. Selected papers , 1996 .

[39]  J. Klein,et al.  Divergence time and population size in the lineage leading to modern humans. , 1995, Theoretical population biology.

[40]  木村 資生,et al.  Population genetics, molecular evolution, and the neutral theory : selected papers , 1994 .

[41]  M. Nei,et al.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. , 1993, Molecular biology and evolution.

[42]  D. Hartl,et al.  Population genetics of polymorphism and divergence. , 1992, Genetics.

[43]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[44]  M. Kreitman,et al.  Adaptive protein evolution at the Adh locus in Drosophila , 1991, Nature.

[45]  Mandy J. Haldane,et al.  A Mathematical Theory of Natural and Artificial Selection, Part V: Selection and Mutation , 1927, Mathematical Proceedings of the Cambridge Philosophical Society.

[46]  J. B. S. Haldane,et al.  A mathematical theory of natural and artificial selection—I , 1927, Mathematical Proceedings of the Cambridge Philosophical Society.