Gossip is synteny: incomplete gossip and an exact algorithm for syntenic distance

The <i>syntenic distance</i> between two genomes is given by the minimum number of fusions, fissions, and translocations required to transform one into the other, ignoring the order of genes within chromosomes. The problem of computing this distance is NP-complete. In this paper, we give an <i>&Ogr;</i>(2<sup><i>&Ogr;</i>(<i>n</i> log <i>n</i>)</sup>) algorithm to exactly compute the syntenic distance between two genomes that contain at most <i>n</i> chromosomes. Our algorithm requires <i>&Ogr;</i>(2<sup><i>&Ogr;</i>(<i>d</i> log <i>d</i>)</sup>) time when this distance is <i>d</i>, improving the <i>&Ogr;</i>(2<sup><i>&Ogr;</i>(<i>d</i><sup>2</sup>)</sup>) running time of the beat previous exact algorithm. Our result is based upon a tight connection between syntenic distance and a novel generalization of the classical <i>gossip problem</i>. We define the <i>incomplete gossip problem</i>, in which there are <i>n</i> gossipers who each have a unique piece of initial information. They communicate by phone calls in which the participants exchange all their information, and the goal is to minimize the total number of phone calls necessary to inform each gossiper of his set of <i>relevant gossip</i> which he desires to learn.

[1]  Tao Jiang,et al.  On the complexity and approximation of syntenic distance , 1997, RECOMB '97.

[2]  Dana S. Richards,et al.  Generalizations of broadcasting and gossiping , 1988, Networks.

[3]  David Sankoff,et al.  Original Synteny , 1996, CPM.

[4]  Sridhar Hannenhalli,et al.  Polynomial-time Algorithm for Computing Translocation Distance Between Genomes , 1995, Discret. Appl. Math..

[5]  Caj Cor Hurkens Spreading gossip efficiently , 2000 .

[6]  Alberto Caprara,et al.  Sorting Permutations by Reversals and Eulerian Cycle Decompositions , 1999, SIAM J. Discret. Math..

[7]  Arthur L. Liestman,et al.  A survey of gossiping and broadcasting in communication networks , 1988, Networks.

[8]  Shietung Peng,et al.  A 2-Approximation Algorithm for Genome Rearrangements by Reversals and Transpositions , 1999, Theor. Comput. Sci..

[9]  Gerard J. Chang,et al.  Set to Set Broadcasting in Communication Networks , 1992, Discret. Appl. Math..

[10]  D. Sankoff,et al.  Synteny conservation and chromosome rearrangements during mammalian evolution. , 1997, Genetics.

[11]  Marie-France Sagot,et al.  Further Thoughts on the Syntenic Distance between Genomes , 2002, Algorithmica.

[12]  David Liben-Nowell On the Structure of Syntenic Distance , 1999, CPM.

[13]  Vineet Bafna,et al.  Sorting by Transpositions , 1998, SIAM J. Discret. Math..

[14]  E. C. Milner,et al.  A Cure for the Telephone Disease , 1972, Canadian Mathematical Bulletin.

[15]  David A. Christie,et al.  A 3/2-approximation algorithm for sorting by reversals , 1998, SODA '98.

[16]  Gerard J. Chang,et al.  The partial gossiping problem , 1996, Discret. Math..

[17]  Jon M. Kleinberg,et al.  Structural Properties and Tractability Results for Linear Synteny , 2000, CPM.

[18]  David Sankoff,et al.  Conserved Synteny As a Measure of Genomic Distance , 1996, Discret. Appl. Math..

[19]  Pavel A. Pevzner,et al.  Transforming men into mice (polynomial algorithm for genomic distance problem) , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[20]  Brenda S. Baker,et al.  Gossips and telephones , 1972, Discret. Math..

[21]  R. Bumby A Problem with Telephones , 1981 .

[22]  R. Ravi,et al.  Of mice and men: algorithms for evolutionary distances between genomes with translocation , 1995, SODA '95.

[23]  D. Sankoff,et al.  Comparative Genomics: "Empirical And Analytical Approaches To Gene Order Dynamics, Map Alignment And The Evolution Of Gene Families" , 2000 .

[24]  Vineet Bafna,et al.  Genome rearrangements and sorting by reversals , 1993, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.

[25]  Jon Kleinberg,et al.  The Syntenic Diameter of the Space of N-Chromosome Genomes , 2000 .