Consistency of Sequence-Based Gene Clusters

In comparative genomics, various combinatorial models can be used to specify gene clusters--groups of genes that are co-located in a set of genomes. Several approaches have been proposed to reconstruct putative ancestral gene clusters based on the gene order of contemporary species. One prevalent and natural reconstruction criterion is consistency: For a set of reconstructed gene clusters, there should exist a gene order that comprises all given clusters. In this paper, we discuss the consistency problem for different gene cluster models on sequences with restricted gene multiplicities. Our results range from linear-time algorithms for the simple model of adjacencies to NP completeness for more complex models like common intervals.

[1]  Roland Wittler Phylogeny-based analysis of gene clusters , 2010 .

[2]  Ján Manuch,et al.  The Complexity of the Gapped Consecutive-Ones Property Problem for Matrices of Bounded Maximum Degree , 2010, RECOMB-CG.

[3]  Ján Manuch,et al.  On the Gapped Consecutive-Ones Property , 2009, Electron. Notes Discret. Math..

[4]  J R Roth,et al.  Selfish operons: horizontal transfer may drive the evolution of gene clusters. , 1996, Genetics.

[5]  Sven Rahmann,et al.  Integer Linear Programs for Discovering Approximate Gene Clusters , 2006, WABI.

[6]  Jayme Luiz Szwarcfiter,et al.  Hamilton Paths in Grid Graphs , 1982, SIAM J. Comput..

[7]  Cédric Chauve,et al.  A Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes , 2008, PLoS Comput. Biol..

[8]  Charles E. Chapple,et al.  Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype , 2004, Nature.

[9]  Salim Haddadi A note on the NP–hardness of the consecutive block minimization problem , 2002 .

[10]  R. Page,et al.  From gene to organismal phylogeny: reconciled trees and the gene tree/species tree problem. , 1997, Molecular phylogenetics and evolution.

[11]  Cedric Chauve,et al.  Formal Models of Gene Clusters , 2007 .

[12]  Takeaki Uno,et al.  Fast Algorithms to Enumerate All Common Intervals of Two Permutations , 1997, Algorithmica.

[13]  Kellogg S. Booth,et al.  Testing for the Consecutive Ones Property, Interval Graphs, and Graph Planarity Using PQ-Tree Algorithms , 1976, J. Comput. Syst. Sci..

[14]  Dannie Durand,et al.  The Incompatible Desiderata of Gene Cluster Properties , 2005, Comparative Genomics.

[15]  Haim Kaplan,et al.  Four Strikes Against Physical Mapping of DNA , 1995, J. Comput. Biol..

[16]  Laurent Viennot,et al.  Lex-BFS and partition refinement, with applications to transitive orientation, interval graph recognition and consecutive ones testing , 2000, Theor. Comput. Sci..

[17]  Jens Stoye,et al.  Finding Nested Common Intervals Efficiently , 2009, RECOMB-CG.

[18]  David Sankoff,et al.  Tests for gene clustering , 2002, RECOMB '02.

[19]  Mikl'os CsHuros,et al.  Mathematical Framework for Phylogenetic Birth-And-Death Models , 2009, 0902.0970.

[20]  J. Risler,et al.  Identification of genomic features using microsyntenies of domains: domain teams. , 2005, Genome research.

[21]  Arjun Bhutkar,et al.  Inferring genome-scale rearrangement phylogeny and ancestral gene order: a Drosophila case study , 2007, Genome Biology.

[22]  David Sankoff,et al.  Common Intervals and Symmetric Difference in a Model-Free Phylogenomics, with an Application to Streptophyte Evolution , 2006, Comparative Genomics.

[23]  Jens Stoye,et al.  Computation of Median Gene Clusters , 2009, J. Comput. Biol..

[24]  Ján Manuch,et al.  Consistency of Sequence-Based Gene Clusters , 2011, J. Comput. Biol..

[25]  Christos H. Papadimitriou,et al.  Computational complexity , 1993 .

[26]  Xin He,et al.  Identifying Conserved Gene Clusters in the Presence of Homology Families , 2005, J. Comput. Biol..

[27]  Annie Chateau,et al.  Reconstructing Ancestral Gene Orders Using Conserved Intervals , 2004, WABI.

[28]  R. Overbeek,et al.  The use of gene clusters to infer functional coupling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[29]  A. Hughes,et al.  Gene duplication and the structure of eukaryotic genomes. , 2001, Genome research.

[30]  Jens Stoye,et al.  On the Similarity of Sets of Permutations and Its Applications to Genome Comparison , 2006, J. Comput. Biol..

[31]  Bernard B. Suh,et al.  Reconstructing contiguous regions of an ancestral genome. , 2006, Genome research.

[32]  Wen-Lian Hsu A Simple Test for the Consecutive Ones Property , 2002, J. Algorithms.

[33]  P Bork,et al.  Inversions and the dynamics of eukaryotic gene order. , 2001, Trends in genetics : TIG.

[34]  Wen-Lian Hsu,et al.  PQ Trees, PC Trees, and Planar Graphs , 2004, Handbook of Data Structures and Applications.

[35]  Jens Stoye,et al.  A Unified Approach for Reconstructing Ancient Gene Clusters , 2009, TCBB.