The Solution Space of Sorting by DCJ

In genome rearrangements, the double cut and join (DCJ) operation, introduced by Yancopoulos et al. in 2005, allows one to represent most rearrangement events that could happen in multichromosomal genomes, such as inversions, translocations, fusions, and fissions. No restriction on the genome structure considering linear and circular chromosomes is imposed. An advantage of this general model is that it leads to considerable algorithmic simplifications compared to other genome rearrangement models. Recently, several works concerning the DCJ operation have been published, and in particular, an algorithm was proposed to find an optimal DCJ sequence for sorting one genome into another one. Here we study the solution space of this problem and give an easy-to-compute formula that corresponds to the exact number of optimal DCJ sorting sequences for a particular subset of instances of the problem. We also give an algorithm to count the number of optimal sorting sequences for any instance of the problem. Another interesting result is the demonstration of the possibility of obtaining one optimal sorting sequence by properly replacing any pair of consecutive operations in another optimal sequence. As a consequence, any optimal sorting sequence can be obtained from one other by applying such replacements successively, but the problem of finding the shortest number of replacements between two sorting sequences is still open.

[1]  Marie-France Sagot,et al.  Exploring the Solution Space of Sorting by Reversals, with Experiments and an Application to Evolution , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  Pavel A. Pevzner,et al.  Transforming men into mice (polynomial algorithm for genomic distance problem) , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[3]  Jens Stoye,et al.  A Unifying View of Genome Rearrangements , 2006, WABI.

[4]  Jens Stoye,et al.  Counting All DCJ Sorting Scenarios , 2009, RECOMB-CG.

[5]  Richard Friedberg,et al.  Efficient sorting of genomic permutations by translocation, inversion and block interchange , 2005, Bioinform..

[6]  Anne Bergeron,et al.  Parking Functions, Labeled Trees and DCJ Sorting Scenarios , 2009, RECOMB-CG.

[7]  David Sankoff,et al.  Edit Distance for Genome Comparison Based on Non-local Operations * 1 Role of Rearrangements in Evolution , .

[8]  Tzvika Hartman,et al.  On the Properties of Sequences of Reversals that Sort a Signed Permutation , 2002 .

[9]  P. Pevzner,et al.  Breakpoint graphs and ancestral genome reconstructions. , 2009, Genome research.

[10]  Guillaume Fertin,et al.  Combinatorics of Genome Rearrangements , 2009, Computational molecular biology.

[11]  Marie-France Sagot,et al.  Exploring the Solution Space of Sorting by Reversals, with Experiments and an Application to Evolution , 2008, TCBB.

[12]  David Sankoff,et al.  Multichromosomal median and halving problems under different genomic distances , 2009, BMC Bioinformatics.

[13]  Adam C. Siepel An Algorithm to Enumerate Sorting Reversals for Signed Permutations , 2003, J. Comput. Biol..