Genome sequence comparison and scenarios for gene rearrangements: a test case.

As large portions of related genomes are being sequenced, methods for comparing complete or nearly complete genomes, as opposed to comparing individual genes, are becoming progressively more important. A major, widespread phenomenon in genome evolution is the rearrangement of genes and gene blocks. There is, however, no consistent method for genome sequence comparison combined with the reconstruction of the evolutionary history of highly rearranged genomes. We developed a schema for genome sequence comparison that includes three successive steps: (i) comparison of all proteins encoded in different genomes and generation of genomic similarity plots; (ii) construction of an alphabet of conserved genes and gene blocks; and (iii) generation of most parsimonious genome rearrangement scenarios. The approach is illustrated by a comparison of the herpesvirus genomes that constitute the largest set of relatively long, complete genome sequences available to date. Herpesviruses have from 70 to about 200 genes; comparison of the amino acid sequences encoded in these genes results in an alphabet of about 30 conserved genes comprising 7 conserved blocks that are rearranged in the genomes of different herpesviruses. Algorithms to analyze rearrangements of multiple genomes were developed and applied to the derivation of most parsimonious scenarios of herpesvirus evolution under different evolutionary models. The developed approaches to genome comparison will be applicable to the comparative analysis of bacterial and eukaryotic genomes as soon as their sequences become available.

[1]  Pavel A. Pevzner,et al.  Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals , 1995, JACM.

[2]  C. Sander,et al.  From genome sequences to protein function , 1994 .

[3]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[4]  D. Sankoff,et al.  Gene order comparisons for phylogenetic inference: evolution of the mitochondrial genome. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Nadeau,et al.  Lengths of chromosomal segments conserved since divergence of man and mouse. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[6]  M. Nei,et al.  The neighbor-joining method , 1987 .

[7]  U. Gompels,et al.  Murine herpesvirus 68 is genetically related to the gammaherpesviruses Epstein-Barr virus and herpesvirus saimiri. , 1990, The Journal of general virology.

[8]  J. Kapsenberg [The varicella-zoster virus]. , 1967, Nederlands tijdschrift voor geneeskunde.

[9]  David Sankoff,et al.  Exact and Approximation Algorithms for the Inversion Distance Between Two Chromosomes , 1993, CPM.

[10]  D. McGeoch,et al.  The genomes of the human herpesviruses: contents, relationships, and evolution. , 1989, Annual review of microbiology.

[11]  A. Davison,et al.  Channel catfish virus: a new type of herpesvirus. , 1992, Virology.

[12]  David Sankoff,et al.  Efficient Bounds for Oriented Chromosome Inversion Distance , 1994, CPM.

[13]  W. Ewens,et al.  The chromosome inversion problem , 1982 .

[14]  B. Barrell,et al.  Analysis of the protein-coding content of the sequence of human cytomegalovirus strain AD169. , 1990, Current topics in microbiology and immunology.

[15]  Albrecht,et al.  Primary structure of the herpesvirus saimiri genome , 1992, Journal of virology.

[16]  P. Pevzner,et al.  Sorting by Reversals: Genome Rearrangements in Plant Organelles and Evolutionary History of X Chromosome , 1995 .

[17]  T. Dobzhansky,et al.  Inversions in the Third Chromosome of Wild Races of Drosophila Pseudoobscura, and Their Use in the Study of the History of the Species. , 1936, Proceedings of the National Academy of Sciences of the United States of America.

[18]  S. Altschul,et al.  Issues in searching molecular sequence databases , 1994, Nature Genetics.

[19]  D. Couvet,et al.  A model simulating the dynamics of plant mitochondrial genomes. , 1993, Genetics.

[20]  W. Hammerschmidt,et al.  Specificity of cleavage in replicative-form DNA of bovine herpesvirus 1 , 1988, Journal of virology.

[21]  A J Davison,et al.  The complete DNA sequence of varicella-zoster virus. , 1986, The Journal of general virology.

[22]  D Sankoff,et al.  Analytical approaches to genomic evolution. , 1993, Biochimie.

[23]  C. Fraefel,et al.  Immediate-early transcription over covalently joined genome ends of bovine herpesvirus 1: the circ gene , 1993, Journal of virology.

[24]  T. Dobzhansky,et al.  Inversions in the Chromosomes of Drosophila Pseudoobscura. , 1938, Genetics.

[25]  D. McGeoch,et al.  Emerging functions of alphaherpesvirus genes , 1993 .

[26]  Jeffrey D. Palmer,et al.  Mitochondrial DNA in Plant Systematics: Applications and Limitations , 1992 .

[27]  Vineet Bafna,et al.  Genome Rearrangements and Sorting by Reversals , 1996, SIAM J. Comput..

[28]  Robert Altenloh From a Novel , 1953 .

[29]  Vineet Bafna,et al.  Genome rearrangements and sorting by reversals , 1993, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.