Consistency of Sequence-Based Gene Clusters

In comparative genomics, differences or similarities of gene orders are determined to predict functional relations of genes or phylogenetic relations of genomes. For this purpose, various combinatorial models can be used to specify gene clusters--groups of genes that are co-located in a set of genomes. Several approaches have been proposed to reconstruct putative ancestral gene clusters based on the gene order of contemporary species. One prevalent and natural reconstruction criterion is consistency: For a set of reconstructed gene clusters, there should exist a gene order that comprises all given clusters. For permutation-based gene cluster models, efficient methods exist to verify this condition. In this article, we discuss the consistency problem for different gene cluster models on sequences with restricted gene multiplicities. Our results range from linear-time algorithms for the simple model of adjacencies to NP-completeness proofs for more complex models like common intervals.

[1]  Mikl'os CsHuros,et al.  Mathematical Framework for Phylogenetic Birth-And-Death Models , 2009, 0902.0970.

[2]  David Sankoff,et al.  Tests for gene clustering , 2002, RECOMB '02.

[3]  J. Risler,et al.  Identification of genomic features using microsyntenies of domains: domain teams. , 2005, Genome research.

[4]  Kellogg S. Booth,et al.  Testing for the Consecutive Ones Property, Interval Graphs, and Graph Planarity Using PQ-Tree Algorithms , 1976, J. Comput. Syst. Sci..

[5]  Bernard B. Suh,et al.  Reconstructing contiguous regions of an ancestral genome. , 2006, Genome research.

[6]  P Bork,et al.  Inversions and the dynamics of eukaryotic gene order. , 2001, Trends in genetics : TIG.

[7]  Wen-Lian Hsu,et al.  PQ Trees, PC Trees, and Planar Graphs , 2004, Handbook of Data Structures and Applications.

[8]  Jens Stoye,et al.  A Unified Approach for Reconstructing Ancient Gene Clusters , 2009, TCBB.

[9]  J R Roth,et al.  Selfish operons: horizontal transfer may drive the evolution of gene clusters. , 1996, Genetics.

[10]  Dannie Durand,et al.  The Incompatible Desiderata of Gene Cluster Properties , 2005, Comparative Genomics.

[11]  Sven Rahmann,et al.  Integer Linear Programs for Discovering Approximate Gene Clusters , 2006, WABI.

[12]  R. Page,et al.  From gene to organismal phylogeny: reconciled trees and the gene tree/species tree problem. , 1997, Molecular phylogenetics and evolution.

[13]  Wen-Lian Hsu A Simple Test for the Consecutive Ones Property , 2002, J. Algorithms.

[14]  Roland Wittler Phylogeny-based analysis of gene clusters , 2010 .

[15]  Takeaki Uno,et al.  Fast Algorithms to Enumerate All Common Intervals of Two Permutations , 1997, Algorithmica.

[16]  Ján Manuch,et al.  On the Gapped Consecutive-Ones Property , 2009, Electron. Notes Discret. Math..

[17]  Jayme Luiz Szwarcfiter,et al.  Hamilton Paths in Grid Graphs , 1982, SIAM J. Comput..

[18]  Salim Haddadi A note on the NP–hardness of the consecutive block minimization problem , 2002 .

[19]  Jens Stoye,et al.  A Unified Approach for Reconstructing Ancient Gene Clusters , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[20]  R. Overbeek,et al.  The use of gene clusters to infer functional coupling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Jens Stoye,et al.  Computation of Median Gene Clusters , 2009, J. Comput. Biol..

[22]  David Sankoff,et al.  Common Intervals and Symmetric Difference in a Model-Free Phylogenomics, with an Application to Streptophyte Evolution , 2006, Comparative Genomics.

[23]  Haim Kaplan,et al.  Four Strikes Against Physical Mapping of DNA , 1995, J. Comput. Biol..

[24]  Arjun Bhutkar,et al.  Inferring genome-scale rearrangement phylogeny and ancestral gene order: a Drosophila case study , 2007, Genome Biology.

[25]  A. Hughes,et al.  Gene duplication and the structure of eukaryotic genomes. , 2001, Genome research.

[26]  Jens Stoye,et al.  On the Similarity of Sets of Permutations and Its Applications to Genome Comparison , 2006, J. Comput. Biol..

[27]  Jens Stoye,et al.  Finding Nested Common Intervals Efficiently , 2009, RECOMB-CG.

[28]  Cédric Chauve,et al.  A Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes , 2008, PLoS Comput. Biol..

[29]  Charles E. Chapple,et al.  Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype , 2004, Nature.

[30]  Laurent Viennot,et al.  Lex-BFS and partition refinement, with applications to transitive orientation, interval graph recognition and consecutive ones testing , 2000, Theor. Comput. Sci..

[31]  Jens Stoye,et al.  Computation of Median Gene Clusters , 2008, RECOMB.

[32]  Xin He,et al.  Identifying Conserved Gene Clusters in the Presence of Homology Families , 2005, J. Comput. Biol..

[33]  Annie Chateau,et al.  Reconstructing Ancestral Gene Orders Using Conserved Intervals , 2004, WABI.