3-way Networks: Application of Hypergraphs for Modelling Increased Complexity in Comparative Genomics

We present and develop the theory of 3-way networks, a type of hypergraph in which each edge models relationships between triplets of objects as opposed to pairs of objects as done by standard network models. We explore approaches of how to prune these 3-way networks, illustrate their utility in comparative genomics and demonstrate how they find relationships which would be missed by standard 2-way network models using a phylogenomic dataset of 211 bacterial genomes.

[1]  J. Sekiguchi,et al.  Characterization of a polysaccharide deacetylase gene homologue (pdaB) on sporulation of Bacillus subtilis. , 2004, Journal of biochemistry.

[2]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[3]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[4]  S. Dongen Graph clustering by flow simulation , 2000 .

[5]  S. Foster,et al.  A Polysaccharide Deacetylase Gene (pdaA) Is Required for Germination and for Production of Muramic δ-Lactam Residues in the Spore Cortex of Bacillus subtilis , 2002, Journal of bacteriology.

[6]  Eugene W. Myers,et al.  Basic local alignment search tool. Journal of Molecular Biology , 1990 .

[7]  Eleftherios T. Papoutsakis,et al.  A comparative genomic view of clostridial sporulation and physiology , 2005, Nature Reviews Microbiology.

[8]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[9]  Mark Johnson,et al.  NCBI BLAST: a better web interface , 2008, Nucleic Acids Res..

[10]  Jotun Hein,et al.  Rahnuma: hypergraph-based tool for metabolic pathway prediction and network comparison , 2009, Bioinform..

[11]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[12]  J. T. Curtis,et al.  An Ordination of the Upland Forest Communities of Southern Wisconsin , 1957 .

[13]  Byoung-Tak Zhang,et al.  Bayesian evolutionary hypergraph learning for predicting cancer clinical outcomes , 2014, J. Biomed. Informatics.

[14]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[15]  Paul Keim,et al.  Whole-Genome-Based Phylogeny and Divergence of the Genus Brucella , 2009, Journal of bacteriology.

[16]  Frode Ødegaard,et al.  A multiple-site similarity measure , 2007, Biology Letters.

[17]  Jonathan L. Gross,et al.  Handbook of graph theory , 2007, Discrete mathematics and its applications.

[18]  Byoung-Tak Zhang,et al.  Constructing higher-order miRNA-mRNA interaction networks in prostate cancer via hypergraph-based learning , 2013, BMC Systems Biology.

[19]  Yue Gao,et al.  Feature Correlation Hypergraph: Exploiting High-order Potentials for Multimodal Recognition , 2014, IEEE Transactions on Cybernetics.

[20]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[21]  Henry Soldano,et al.  Use of ternary similarities in graph based clustering for protein structural family classification , 2010, BCB '10.

[22]  Lei Liu,et al.  Exploring photosynthesis evolution by comparative analysis of metabolic networks between chloroplasts and photosynthetic bacteria , 2006, BMC Genomics.

[23]  R. Fisher,et al.  The Logic of Inductive Inference , 1935 .

[24]  R. Losick,et al.  Molecular genetics of sporulation in Bacillus subtilis. , 1996, Annual review of genetics.

[25]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[26]  H. Wolda,et al.  Similarity indices, sample size and diversity , 1981, Oecologia.

[27]  Igor L. Markov,et al.  Hypergraph Partitioning and Clustering , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[28]  Onur Seref,et al.  Decomposition of Flux Distributions into Metabolic Pathways , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[29]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[30]  R. Losick,et al.  Gene Conservation among Endospore-Forming Bacteria Reveals Additional Sporulation Genes in Bacillus subtilis , 2012, Journal of bacteriology.