Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution.

Six measures of evolutionary change in the human genome were studied, three derived from the aligned human and mouse genomes in conjunction with the Mouse Genome Sequencing Consortium, consisting of (1) nucleotide substitution per fourfold degenerate site in coding regions, (2) nucleotide substitution per site in relics of transposable elements active only before the human-mouse speciation, and (3) the nonaligning fraction of human DNA that is nonrepetitive or in ancestral repeats; and three derived from human genome data alone, consisting of (4) SNP density, (5) frequency of insertion of transposable elements, and (6) rate of recombination. Features 1 and 2 are measures of nucleotide substitutions at two classes of "neutral" sites, whereas 4 is a measure of recent mutations. Feature 3 is a measure dominated by deletions in mouse, whereas 5 represents insertions in human. It was found that all six vary significantly in megabase-sized regions genome-wide, and many vary together. This indicates that some regions of a genome change slowly by all processes that alter DNA, and others change faster. Regional variation in all processes is correlated with, but not completely accounted for, by GC content in human and the difference between GC content in human and mouse.

[1]  D R Bentley,et al.  Long-range comparison of human and mouse SCL loci: localized regions of sensitivity to restriction endonucleases correspond precisely with peaks of conserved noncoding sequences. , 2001, Genome research.

[2]  R. Gibbs,et al.  Comparative sequence analysis of a gene-rich cluster at human chromosome 12p13 and its syntenic region in mouse chromosome 6. , 1998, Genome research.

[3]  Laurence D. Hurst,et al.  Sensitivity of Patterns of Molecular Evolution to Alterations in Methodology: A Critique of Hughes and Yeager , 1998, Journal of Molecular Evolution.

[4]  M. Seldin,et al.  Human/mouse homology relationships. , 1996, Genomics.

[6]  Jeffrey W Touchman,et al.  Generation and comparative analysis of approximately 3.3 Mb of mouse genomic sequence orthologous to the region of human chromosome 7q11.23 implicated in Williams syndrome. , 2002, Genome research.

[7]  Jia Li,et al.  Significance Of inter-species matches when evolutionary rate varies , 2002, RECOMB '02.

[8]  J. M. Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[9]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[10]  C. Hutchison,et al.  Nucleotide sequence of the BALB/c mouse β-globin complex , 1989 .

[11]  A. Carrano,et al.  Genomic sequence comparison of the human and mouse XRCC1 DNA repair gene regions. , 1995, Genomics.

[12]  L. Hurst,et al.  Is the synonymous substitution rate in mammals gene-specific? , 2002, Molecular biology and evolution.

[13]  Wen-Hsiung Li,et al.  Mutation rates differ among regions of the mammalian genome , 1989, Nature.

[14]  Hans Ellegren,et al.  Deterministic mutation rate variation in the human genome. , 2002, Genome research.

[15]  G Bernardi,et al.  Misunderstandings about isochores. Part 1. , 2001, Gene.

[16]  M. Nei Molecular Evolutionary Genetics , 1987 .

[17]  S. Boissinot,et al.  Mutation Pattern Variation Among Regions of the Primate Genome , 1997, Journal of Molecular Evolution.

[18]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[19]  Minoru Kanehisa,et al.  The size differences among mammalian introns are due to the accumulation of small deletions , 1996, FEBS letters.

[20]  H. Akashi,et al.  A test of translational selection at 'silent' sites in the human genome: base composition comparisons in alternatively spliced genes. , 2000, Gene.

[21]  Jia Li,et al.  Significance of Interspecies Matches when Evolutionary Rate Varies , 2003, J. Comput. Biol..

[22]  K. J. Fryxell,et al.  Cytosine deamination plays a primary role in the evolution of mammalian isochores. , 2000, Molecular biology and evolution.

[23]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[24]  M. Boguski,et al.  Synonymous and Nonsynonymous Substitution Distances Are Correlated in Mouse and Rat Genes , 1998, Journal of Molecular Evolution.

[25]  P. Sharp,et al.  Chromosomal location effects on gene sequence evolution in mammals , 1999, Current Biology.

[26]  A. Clark,et al.  Local rates of recombination are positively correlated with GC content in the human genome. , 2001, Molecular biology and evolution.

[27]  S Schwartz,et al.  Comparative analysis of the gene-dense ACHE/TFR2 region on human chromosome 7q22 with the orthologous region on mouse chromosome 5. , 2001, Nucleic acids research.

[28]  W. Li,et al.  Genomic divergence between human and chimpanzee estimated from large-scale alignments of genomic sequences. , 2001, The Journal of heredity.

[29]  X. Gu,et al.  A model for the correlation of mutation rate with GC content and the origin of GC-rich isochores , 1994, Journal of Molecular Evolution.

[30]  Carsten Schwarz,et al.  Genomewide comparison of DNA sequences between humans and chimpanzees. , 2002, American journal of human genetics.

[31]  J. Archibald,et al.  Late Cretaceous relatives of rabbits, rodents, and other extant eutherian mammals , 2001, Nature.

[32]  B. Charlesworth The effect of background selection against deleterious mutations on weakly selected, linked variants. , 1994, Genetical research.

[33]  D. Mindell Fundamentals of molecular evolution , 1991 .

[34]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[35]  V. B. Yap,et al.  Association between divergence and interspersed repeats in mammalian noncoding genomic DNA , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[36]  R. Gibbs,et al.  Large-scale comparative sequence analysis of the human and murine Bruton's tyrosine kinase loci reveals conserved regulatory domains. , 1997, Genome research.

[37]  W. Miller,et al.  Distinguishing regulatory DNA from neutral sites. , 2003, Genome research.

[38]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[39]  L. Hood,et al.  Striking sequence similarity over almost 100 kilobases of human and mouse T–cell receptor DNA , 1994, Nature Genetics.

[40]  A. Hughes,et al.  Natural selection at major histocompatibility complex loci of vertebrates. , 1998, Annual review of genetics.

[41]  B. Koop,et al.  Human and rodent DNA sequence comparisons: a mosaic model of genomic evolution. , 1995, Trends in genetics : TIG.

[42]  J. Castresana Estimation of genetic distances from human and mouse introns , 2002, Genome Biology.

[43]  W. Miller,et al.  Sequence conservation at human and mouse orthologous common fragile regions, FRA3B/FHIT and Fra14A2/Fhit , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[44]  K. H. Wolfe,et al.  Mammalian DNA replication: mutation biases and the mutation rate. , 1991, Journal of theoretical biology.

[45]  P. Lio’,et al.  Models of molecular evolution and phylogeny. , 1998, Genome research.

[46]  J. Castresana Genes on human chromosome 19 show extreme divergence from the mouse orthologs and a high GC content. , 2002, Nucleic acids research.

[47]  Z. Yang,et al.  Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. , 2000, Molecular biology and evolution.

[48]  Wen-Hsiung Li,et al.  Fundamentals of molecular evolution , 1990 .

[49]  N. Goldman,et al.  A codon-based model of nucleotide substitution for protein-coding DNA sequences. , 1994, Molecular biology and evolution.

[50]  Donna R. Maglott,et al.  RefSeq and LocusLink: NCBI gene-centered resources , 2001, Nucleic Acids Res..

[51]  L. Pennacchio,et al.  Genomic strategies to identify mammalian regulatory sequences , 2001, Nature Reviews Genetics.

[52]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[53]  D. Graur Amino acid composition and the evolutionary rates of protein-coding genes , 2005, Journal of Molecular Evolution.

[54]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[55]  P. Lio’,et al.  Molecular phylogenetics: state-of-the-art methods for looking into the past. , 2001, Trends in genetics : TIG.

[56]  D. Graur,et al.  Nucleic acid composition, codon usage, and the rate of synonymous substitution in protein-coding genes , 1989, Journal of Molecular Evolution.

[57]  Kenneth H. Wolfe,et al.  Mammalian gene evolution: Nucleotide sequence divergence between mouse and rat , 1993, Journal of Molecular Evolution.

[58]  C. Aquadro,et al.  Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster , 1992, Nature.

[59]  A V Carrano,et al.  Sequence analysis of the ERCC2 gene regions in human, mouse, and hamster reveals three linked genes. , 1996, Genomics.

[60]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[61]  L. Hurst,et al.  The proteins of linked genes evolve at similar rates , 2000, Nature.

[62]  W Miller,et al.  Comparative sequence analysis of the mouse and human Lgn1/SMA interval. , 1999, Genomics.

[63]  G Bernardi,et al.  Isochores and the evolutionary genomics of vertebrates. , 2000, Gene.

[64]  A. Ogurtsov,et al.  Selective constraint in intergenic regions of human and mouse genomes. , 2001, Trends in genetics : TIG.

[65]  G. Bernardi,et al.  Compositional constraints and genome evolution , 2005, Journal of Molecular Evolution.

[66]  R. Hardison,et al.  Complete nucleotide sequence of the rabbit β-like globin gene cluster: Analysis of intergenic sequences and comparison with the human β-like globin gene cluster , 1989 .

[67]  W. Miller,et al.  Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. , 1997, Genome research.

[68]  Lon R. Cardon,et al.  A first-generation linkage disequilibrium map of human chromosome 22 , 2002, Nature.

[69]  Martin J Lercher,et al.  Human SNP variability and mutation rate are higher in regions of high recombination. , 2002, Trends in genetics : TIG.

[70]  S. Tavaré Some probabilistic and statistical problems in the analysis of DNA sequences , 1986 .

[71]  Webb Miller,et al.  Generation and Comparative Analysis of ∼3.3 Mb of Mouse Genomic Sequence Orthologous to the Region of Human Chromosome 7q11.23 Implicated in Williams Syndrome , 2002 .

[72]  W Miller,et al.  Comparative genomic sequence analysis of the human and mouse cystic fibrosis transmembrane conductance regulator genes. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[73]  J. Crow,et al.  A molecular approach to estimating the human deleterious mutation rate , 1993, Human mutation.

[74]  L. Hurst,et al.  Local similarity in evolutionary rates extends over whole chromosomes in human-rodent and mouse-rat comparisons: implications for understanding the mechanistic basis of the male mutation bias. , 2001, Molecular biology and evolution.

[75]  Laurence D. Hurst,et al.  The evolution of isochores , 2001, Nature Reviews Genetics.

[76]  M. Boguski,et al.  Comparative analysis of 1196 orthologous mouse and human full-length mRNA and protein sequences. , 1996, Genome research.

[77]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[78]  M. Boguski,et al.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[79]  Sudhir Kumar,et al.  Mutation rates in mammalian genomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[80]  R. Hardison Conserved noncoding sequences are reliable guides to regulatory elements. , 2000, Trends in genetics : TIG.

[81]  Ziheng Yang Estimating the pattern of nucleotide substitution , 1994, Journal of Molecular Evolution.

[82]  M. Stanhope,et al.  Rodent phylogeny and a timescale for the evolution of Glires: evidence from an extensive taxon sampling using three nuclear genes. , 2002, Molecular biology and evolution.

[83]  C. Luo,et al.  A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. , 1985, Molecular biology and evolution.

[84]  G. Bernardi,et al.  The isochore organization of the human genome and its evolutionary history--a review. , 1993, Gene.

[85]  N L Kaplan,et al.  Deleterious background selection with recombination. , 1995, Genetics.

[86]  L. Hurst,et al.  Covariation of GC content and the silent site substitution rate in rodents: implications for methodology and for the evolution of isochores. , 2000, Gene.

[87]  S Schwartz,et al.  Sequence and comparative analysis of the rabbit alpha-like globin gene cluster reveals a rapid mode of evolution in a G + C-rich region of mammalian genomes. , 1991, Journal of molecular biology.

[88]  C. Liew,et al.  Concerted evolution of mammalian cardiac myosin heavy chain genes , 2004, Journal of Molecular Evolution.

[89]  G. Bernardi,et al.  The human genome: organization and evolutionary history. , 1995, Annual review of genetics.