Exploring Genome Rearrangements using Virtual Hybridization

Genomes evolve with both mutations and large scale events, such as inversions, translocations, duplications and losses, that modify the structure of a set of chromosomes. In order to study these types of large-scale events, the first task is to select, in different genomes, sub-sequences that are considered “equivalent”. Many approaches have been used to identify equivalent sequences, either based on biological experiments, gene annotations, or sequence alignments. These techniques suffer from a variety of drawbacks that often result in the impossibility, for independent researchers, to reproduce the datasets used in the studies, or to adapt them to newly sequenced genomes. In this paper, we show that carefully selected small probes can be efficiently used to construct datasets. Once a set of probes is identified ‐ and published ‐, datasets for whole genome comparisons can be produced, and reproduced, with elementary algorithms; decisions about what is considered an occurrence of a probe in a genome can be criticized and reevaluated; and the structure of a newly sequenced genome can be obtained rapidly, without the need of gene annotations or intensive computations.

[1]  Jijun Tang,et al.  Inferring Ancestral Chloroplast Genomes with Inverted Repea , 2006, BIOCOMP.

[2]  T. Dobzhansky,et al.  Inversions in the Chromosomes of Drosophila Pseudoobscura. , 1938, Genetics.

[3]  J. D. Parsons,et al.  Improved tools for DNA comparison and clustering , 1995, Comput. Appl. Biosci..

[4]  A. J. Bendich,et al.  Circular Chloroplast Chromosomes: The Grand Illusion , 2004, The Plant Cell Online.

[5]  P. Pevzner,et al.  Reconstructing the genomic architecture of ancestral mammals: lessons from human, mouse, and rat genomes. , 2004, Genome research.

[6]  J. Risler,et al.  Identification of genomic features using microsyntenies of domains: domain teams. , 2005, Genome research.

[7]  Annie Chateau,et al.  Inferring Positional Homologs with Common Intervals of Sequences , 2006, Comparative Genomics.

[8]  J. Boore Animal mitochondrial genomes. , 1999, Nucleic acids research.

[9]  D. Ward,et al.  Karyotyping human chromosomes by combinatorial multi-fluor FISH , 1996, Nature Genetics.

[10]  Jijun Tang,et al.  Inferring ancestral chloroplast genomes with inverted repeat , 2006 .

[11]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[12]  W. Martin,et al.  Chloroplast genome phylogenetics: why we need independent approaches to plant molecular evolution. , 2005, Trends in plant science.

[13]  Mei Li,et al.  MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences , 2003, Nucleic Acids Res..

[14]  K. H. Wolfe,et al.  Ebb and flow of the chloroplast inverted repeat , 1996, Molecular and General Genetics MGG.

[15]  Jeffrey D. Palmer,et al.  A Chloroplast DNA Phylogeny of the Solanaceae: Subfamilial Relationships and Character Evolution , 1992 .

[16]  Junichi Obokata,et al.  The Rice Nuclear Genome Continuously Integrates, Shuffles, and Eliminates the Chloroplast Genome to Cause Chloroplast–Nuclear DNA Fluxw⃞ , 2005, The Plant Cell Online.

[17]  R. Olmstead,et al.  The first complete chloroplast genome sequence of a lycophyte, Huperzia lucidula (Lycopodiaceae). , 2005, Gene.