Sorting permutations by tanspositions

Vineet Bafnat Sequence comparison in computational molecular biology is a powerful tool for deriving evolutionary and functional relationships between genes. However, classical alignment algorithms handle only local mutations (i.e. insertions, deletions and substitutions of nucleotides) and ignore global rearrangements (i.e. inversions and transpositions of long fragments). As a result, the applications of sequence alignment to analyze highly rearranged genomes (i.e. herpes viruses or plant mitochondrial DNA) are rather limited. The paper addresses the problem of genome comparison versus classical gene comparison and presents algorithms to analyze rearrangements in genomes evolving by transpositions. In the simplest form the problem corresponds to sorting bu tmnspositions, i.e. sorting of an array using transpositions of arbitrary fragments. We derive lower bounds on tmnsposition distance between permutations and present approximation algorithms for sorting by transpositions. The algorithms also imply a non-trivial upper bound on the transposition diameter of the symmetric group. Finally, we formulate two biological problems in genome rearrangements and describe the first algorithmic steps towards their solution.

[1]  Vineet Bafna,et al.  Genome rearrangements and sorting by reversals , 1993, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.

[2]  E. Lander,et al.  A genetic linkage map of the mouse: current applications and future prospects. , 1993, Science.

[3]  Dr. Susumu Ohno Sex Chromosomes and Sex-Linked Genes , 1967, Monographs on Endocrinology.

[4]  Mark Jerrum,et al.  The Complexity of Finding Minimum-Length Generator Sequences , 1985, Theor. Comput. Sci..

[5]  Martin Aigner,et al.  Sorting by insertion of leading elements , 1987, J. Comb. Theory, Ser. A.

[6]  Ivan Hal Sudborough,et al.  On Sorting by Prefix Reversals and the Diameter of Pancake Networks , 1992, Heinz Nixdorf Symposium.

[7]  J. Nadeau,et al.  Lengths of chromosomal segments conserved since divergence of man and mouse. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Lyons Mf The William Allan memorial award address: X-chromosome inactivation and the location and expression of X-linked genes. , 1988 .

[9]  N. Franklin Conservation of genome form but not sequence in the transcription antitermination determinants of bacteriophages λ, φ21 and P22☆ , 1985 .

[10]  Christos H. Papadimitriou,et al.  Bounds for sorting by prefix reversal , 1979, Discret. Math..

[11]  M T Davisson,et al.  X-linked genetic homologies between mouse and man. , 1987, Genomics.

[12]  David Sankoff,et al.  Efficient Bounds for Oriented Chromosome Inversion Distance , 1994, CPM.

[13]  P. Pevzner,et al.  Sorting by Reversals: Genome Rearrangements in Plant Organelles and Evolutionary History of X Chromosome , 1995 .

[14]  S Karlin,et al.  Molecular evolution of herpesviruses: genomic and protein sequence comparisons , 1994, Journal of virology.

[15]  D. Sankoff,et al.  Gene order comparisons for phylogenetic inference: evolution of the mitochondrial genome. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Oded Goldreich,et al.  The Minimum-Length Generator Sequence Problem is NP-Hard , 1981, J. Algorithms.