Forces Shaping the Fastest Evolving Regions in the Human Genome

Comparative genomics allow us to search the human genome for segments that were extensively changed in the last ~5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202 genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements are dramatically changed in human but not in other primates, with seven times more substitutions in human than in chimp. The accelerated elements, and in particular the top five, show a strong bias for adenine and thymine to guanine and cytosine nucleotide changes and are disproportionately located in high recombination and high guanine and cytosine content environments near telomeres, suggesting either biased gene conversion or isochore selection. In addition, there is some evidence of directional selection in the regions containing the two most accelerated regions. A combination of evolutionary forces has contributed to accelerated evolution of the fastest evolving elements in the human genome.

[1]  David Haussler,et al.  Identification and Classification of Conserved RNA Secondary Structures in the Human Genome , 2006, PLoS Comput. Biol..

[2]  P. Donnelly,et al.  The Fine-Scale Structure of Recombination Rate Variation in the Human Genome , 2004, Science.

[3]  T. Nagylaki Evolution of a large population under gene conversion. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[4]  A. Reymond,et al.  Conserved non-genic sequences — an unexpected feature of mammalian genomes , 2005, Nature Reviews Genetics.

[5]  Carlos D Bustamante,et al.  Ascertainment bias in studies of human genome-wide polymorphism. , 2005, Genome research.

[6]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[7]  R. Hudson Gene genealogies and the coalescent process. , 1990 .

[8]  M. Nachman,et al.  Estimate of the mutation rate per nucleotide in humans. , 2000, Genetics.

[9]  Timothy B Sackton,et al.  A Scan for Positively Selected Genes in the Genomes of Humans and Chimpanzees , 2005, PLoS biology.

[10]  I. Ovcharenko,et al.  Human-zebrafish non-coding conserved elements act in vivo to regulate transcription , 2005, Nucleic acids research.

[11]  A. Varki,et al.  A chimpanzee genome project is a biomedical imperative. , 2000, Genome research.

[12]  D. Haussler,et al.  An RNA gene expressed during cortical development evolved rapidly in humans , 2006, Nature.

[13]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[14]  Laurence D. Hurst,et al.  The evolution of isochores , 2001, Nature Reviews Genetics.

[15]  R. Paget The Origin of Speech , 1927, Nature.

[16]  Constance Holden The Origin of Speech , 2004, Science.

[17]  J. Wall A comparison of estimators of the population recombination rate. , 2000, Molecular biology and evolution.

[18]  L. Duret,et al.  Recombination drives the evolution of GC-content in the human genome. , 2004, Molecular biology and evolution.

[19]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[20]  F. Tajima,et al.  Simple methods for testing the molecular evolutionary clock hypothesis. , 1993, Genetics.

[21]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[22]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[23]  R. Hudson,et al.  A test of neutral molecular evolution based on nucleotide data. , 1987, Genetics.

[24]  Peter Donnelly,et al.  The Influence of Recombination on Human Genetic Diversity , 2006, PLoS genetics.

[25]  D. Haussler,et al.  A distal enhancer and an ultraconserved exon are derived from a novel retroposon , 2006, Nature.

[26]  W. Li,et al.  Evidence for higher rates of nucleotide substitution in rodents than in man. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Carl W. Miller,et al.  Mutations in the mitotic check point gene, MAD1L1, in human cancers , 2001, Oncogene.

[28]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[29]  David Haussler,et al.  New Methods for Detecting Lineage-Specific Selection , 2006, RECOMB.

[30]  S. Tavaré Some probabilistic and statistical problems in the analysis of DNA sequences , 1986 .

[31]  Klaudia Walter,et al.  Open access, freely available online PLoS BIOLOGY Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2022 .

[32]  M. Nachman,et al.  Single nucleotide polymorphisms and recombination rate in humans. , 2001, Trends in genetics : TIG.

[33]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[34]  Charles H. Langley,et al.  An examination of the constancy of the rate of molecular evolution , 2005, Journal of Molecular Evolution.

[35]  J. Wall,et al.  Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. , 2001, American journal of human genetics.

[36]  J. M. Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[37]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[38]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[39]  Christine M. Malcom,et al.  Accelerated Evolution of Nervous System Genes in the Origin of Homo sapiens , 2004, Cell.

[40]  T. Nagylaki Evolution of a finite population under gene conversion. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[41]  P. Donnelly,et al.  A Fine-Scale Map of Recombination Rates and Hotspots Across the Human Genome , 2005, Science.

[42]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[43]  Sudhir Kumar,et al.  Vertebrate Genomes Compared , 2002, Science.

[44]  Matthew W. Hahn,et al.  Ancient and Recent Positive Selection Transformed Opioid cis-Regulation in Humans , 2005, PLoS biology.

[45]  J H Gillespie,et al.  Lineage effects and the index of dispersion of molecular evolution. , 1989, Molecular biology and evolution.

[46]  R. Nielsen,et al.  Detecting Selection in Noncoding Regions of Nucleotide Sequences , 2004, Genetics.

[47]  L. Pauling,et al.  Evolutionary Divergence and Convergence in Proteins , 1965 .

[48]  Sudhir Kumar,et al.  Genomics. Vertebrate genomes compared. , 2002, Science.

[49]  R. Tjian,et al.  Transcription regulation and animal diversity , 2003, Nature.

[50]  Alan Ashworth,et al.  Evolutionary rate of a gene affected by chromosomal position , 1999, Current Biology.

[51]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[52]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[53]  M V Olson,et al.  When less is more: gene loss as an engine of evolutionary change. , 1999, American journal of human genetics.

[54]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[55]  M. Kreitman,et al.  Methods to detect selection in populations with applications to the human. , 2000, Annual review of genomics and human genetics.

[56]  Barbara J. Trask,et al.  Human subtelomeres are hot spots of interchromosomal recombination and segmental duplication , 2005, Nature.

[57]  N L Kaplan,et al.  The "hitchhiking effect" revisited. , 1989, Genetics.

[58]  A. von Haeseler,et al.  Inference of population history using a likelihood approach. , 1998, Genetics.

[59]  G Bernardi,et al.  The compositional evolution of vertebrate genomes. , 2000, Gene.

[60]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[61]  M. Nóbrega,et al.  Scanning Human Gene Deserts for Long-Range Enhancers , 2003, Science.

[62]  Giorgio Bernardi,et al.  Structural and evolutionary genomics : natural selection in genome evolution , 2004 .

[63]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[64]  Matthew Stephens,et al.  Absence of the TAP2 Human Recombination Hotspot in Chimpanzees , 2004, PLoS biology.

[65]  L. Duret,et al.  Vanishing GC-rich isochores in mammalian genomes. , 2002, Genetics.

[66]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[67]  D. Haussler,et al.  Phylogenetic estimation of context-dependent substitution rates by maximum likelihood. , 2003, Molecular biology and evolution.

[68]  S. Liu-Cordero,et al.  The discovery of single-nucleotide polymorphisms--and inferences about human demographic history. , 2001, American journal of human genetics.

[69]  L. Brooks,et al.  A DNA polymorphism discovery resource for research on human genetic variation. , 1998, Genome research.

[70]  A. Helwak,et al.  High Guanine and Cytosine Content Increases mRNA Levels in Mammalian Cells , 2006, PLoS biology.

[71]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[72]  A. Monaco,et al.  Molecular evolution of FOXP2, a gene involved in speech and language , 2002, Nature.

[73]  B. Shafer,et al.  DNA synthesis errors associated with double-strand-break repair. , 1995, Genetics.

[74]  M. Hayden,et al.  Identification of a novel gene (HSN2) causing hereditary sensory and autonomic neuropathy type II through the Study of Canadian Genetic Isolates. , 2004, American journal of human genetics.

[75]  David Haussler,et al.  Comparative recombination rates in the rat, mouse, and human genomes. , 2004, Genome research.