On the inversion-indel distance

BackgroundThe inversion distance, that is the distance between two unichromosomal genomes with the same content allowing only inversions of DNA segments, can be computed thanks to a pioneering approach of Hannenhalli and Pevzner in 1995. In 2000, El-Mabrouk extended the inversion model to allow the comparison of unichromosomal genomes with unequal contents, thus insertions and deletions of DNA segments besides inversions. However, an exact algorithm was presented only for the case in which we have insertions alone and no deletion (or vice versa), while a heuristic was provided for the symmetric case, that allows both insertions and deletions and is called the inversion-indel distance. In 2005, Yancopoulos, Attie and Friedberg started a new branch of research by introducing the generic double cut and join (DCJ) operation, that can represent several genome rearrangements (including inversions). Among others, the DCJ model gave rise to two important results. First, it has been shown that the inversion distance can be computed in a simpler way with the help of the DCJ operation. Second, the DCJ operation originated the DCJ-indel distance, that allows the comparison of genomes with unequal contents, considering DCJ, insertions and deletions, and can be computed in linear time.ResultsIn the present work we put these two results together to solve an open problem, showing that, when the graph that represents the relation between the two compared genomes has no bad components, the inversion-indel distance is equal to the DCJ-indel distance. We also give a lower and an upper bound for the inversion-indel distance in the presence of bad components.

[1]  Jens Stoye,et al.  A new linear time algorithm to compute the genomic distance via the double cut and join distance , 2009, Theor. Comput. Sci..

[2]  Richard Friedberg,et al.  Efficient sorting of genomic permutations by translocation, inversion and block interchange , 2005, Bioinform..

[3]  Jens Stoye,et al.  Double Cut and Join with Insertions and Deletions , 2011, J. Comput. Biol..

[4]  João Meidanis,et al.  Introduction to computational molecular biology , 1997 .

[5]  Olivier Gascuel,et al.  Mathematics of Evolution and Phylogeny , 2005 .

[6]  Jens Stoye,et al.  Restricted DCJ Model: Rearrangement Problems with Chromosome Reincorporation , 2011, J. Comput. Biol..

[7]  Jens Stoye,et al.  The Solution Space of Sorting by DCJ , 2010, J. Comput. Biol..

[8]  BMC Bioinformatics , 2005 .

[9]  Marília D. V. Braga,et al.  An Overview of Genomic Distances Modeled with Indels , 2013, CiE.

[10]  Pavel A. Pevzner,et al.  Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals , 1995, JACM.

[11]  Vineet Bafna,et al.  Genome rearrangements and sorting by reversals , 1993, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.

[12]  Richard Friedberg,et al.  DCJ Path Formulation for Genome Transformations which Include Insertions, Deletions, and Duplications , 2009, J. Comput. Biol..

[13]  Simone Dantas,et al.  DCJ-indel and DCJ-substitution distances with distinct operation costs , 2013, Algorithms for Molecular Biology.

[14]  Simone Dantas,et al.  Restricted DCJ-indel model: sorting linear genomes with DCJ and indels , 2012, BMC Bioinformatics.

[15]  References , 1971 .

[16]  Pavel A. Pevzner,et al.  Transforming men into mice (polynomial algorithm for genomic distance problem) , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[17]  Richard Friedberg,et al.  Genome rearrangement by the double cut and join operation. , 2008, Methods in molecular biology.

[18]  Phillip E. C. Compeau DCJ-Indel sorting revisited , 2012, Algorithms for Molecular Biology.

[19]  Jens Stoye,et al.  The inversion distance problem , 2007, Mathematics of Evolution and Phylogeny.

[20]  Jens Stoye,et al.  A Unifying View of Genome Rearrangements , 2006, WABI.

[21]  Maria Emilia M. T. Walter,et al.  Reversal Distance of Signed Circular Chromosomes , 2000 .