Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes

This study examines genomic duplications, deletions, and rearrangements that have happened at scales ranging from a single base to complete chromosomes by comparing the mouse and human genomes. From whole-genome sequence alignments, 344 large (>100-kb) blocks of conserved synteny are evident, but these are further fragmented by smaller-scale evolutionary events. Excluding transposon insertions, on average in each megabase of genomic alignment we observe two inversions, 17 duplications (five tandem or nearly tandem), seven transpositions, and 200 deletions of 100 bases or more. This includes 160 inversions and 75 duplications or transpositions of length >100 kb. The frequencies of these smaller events are not substantially higher in finished portions in the assembly. Many of the smaller transpositions are processed pseudogenes; we define a “syntenic” subset of the alignments that excludes these and other small-scale transpositions. These alignments provide evidence that ≈2% of the genes in the human/mouse common ancestor have been deleted or partially deleted in the mouse. There also appears to be slightly less nontransposon-induced genome duplication in the mouse than in the human lineage. Although some of the events we detect are possibly due to misassemblies or missing data in the current genome sequence or to the limitations of our methods, most are likely to represent genuine evolutionary events. To make these observations, we developed new alignment techniques that can handle large gaps in a robust fashion and discriminate between orthologous and paralogous alignments.

[1]  J. Haldane,et al.  The Causes of Evolution , 1933 .

[2]  B. Bainbridge,et al.  Genetics , 1981, Experientia.

[3]  J. Nadeau,et al.  Lengths of chromosomal segments conserved since divergence of man and mouse. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Wen-Hsiung Li,et al.  Fundamentals of molecular evolution , 1990 .

[5]  D. Mindell Fundamentals of molecular evolution , 1991 .

[6]  Balaji Raghavachari,et al.  Chaining Multiple-Alignment Blocks , 1994, J. Comput. Biol..

[7]  D Graur,et al.  Patterns and rates of indel evolution in processed pseudogenes from humans and murids. , 1997, Gene.

[8]  M. Boguski,et al.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  David Sankoff,et al.  The lengths of undiscovered conserved segments in comparative maps , 1998, Mammalian Genome.

[10]  A. Smit Interspersed repeats and other mementos of transposable elements in mammalian genomes. , 1999, Current opinion in genetics & development.

[11]  G. Benson,et al.  Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.

[12]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[13]  W. J. Kent,et al.  Conservation, regulation, synteny, and introns in a large-scale C. briggsae-C. elegans genomic alignment. , 2000, Genome research.

[14]  S Kumar,et al.  Determination of the number of conserved chromosomal segments between species. , 2001, Genetics.

[15]  B Qian,et al.  Distribution of indel lengths , 2001, Proteins.

[16]  J. Postlethwait,et al.  Measures of synteny conservation between species pairs. , 2002, Genetics.

[17]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[18]  Glenn Tesler,et al.  GRIMM: genome rearrangements web server , 2002, Bioinform..

[19]  L. Hellman,et al.  KRAB zinc finger proteins: an analysis of the molecular mechanisms governing their increase in numbers and complexity during evolution. , 2002, Molecular biology and evolution.

[20]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[21]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[22]  L. Pachter,et al.  Strategies and tools for whole-genome alignments. , 2002, Genome research.

[23]  Tatiana A. Tatusova,et al.  NCBI Reference Sequence Project: update and current status , 2003, Nucleic Acids Res..

[24]  M A Ferguson-Smith,et al.  Reciprocal chromosome painting among human, aardvark, and elephant (superorder Afrotheria) reveals the likely eutherian ancestral karyotype , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[25]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[26]  Ian Dunham,et al.  Reevaluating human gene annotation: a second-generation analysis of chromosome 22. , 2003, Genome research.

[27]  L. Pachter,et al.  SLAM: cross-species gene finding and alignment with a generalized pair hidden Markov model. , 2003, Genome research.

[28]  P. Pevzner,et al.  Genome rearrangements in mammalian evolution: lessons from human and mouse genomes. , 2003, Genome research.

[29]  Xun Gu,et al.  The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignment , 1995, Journal of Molecular Evolution.

[30]  C. Glover,et al.  Gene expression profiling for hematopoietic cell culture , 2006 .