A genome-scale metabolic network alignment method within a hypergraph-based framework using a rotational tensor-vector product

Biological network alignment aims to discover important similarities and differences and thus find a mapping between topological and/or functional components of different biological molecular networks. Then, the mapped components can be considered to correspond to both their places in the network topology and their biological attributes. Development and evolution of biological network alignment methods has been accelerated by the rapidly increasing availability of such biological networks, yielding a repertoire of tens of methods based upon graph theory. However, most biological processes, especially the metabolic reactions, are more sophisticated than simple pairwise interactions and contain three or more participating components. Such multi-lateral relations are not captured by graphs, and computational methods to overcome this limitation are currently lacking. This paper introduces hypergraphs and association hypergraphs to describe metabolic networks and their potential alignments, respectively. Within this framework, metabolic networks are aligned by identifying the maximal Z-eigenvalue of a symmetric tensor. A shifted higher-order power method was utilized to identify a solution. A rotational strategy has been introduced to accelerate the tensor-vector product by 250-fold on average and reduce the storage cost by up to 1,000-fold. The algorithm was implemented on a spark-based distributed computation cluster to significantly increase the convergence rate further by 50- to 80-fold. The parameters have been explored to understand their impact on alignment accuracy and speed. In particular, the influence of initial value selection on the stationary point has been simulated to ensure an accurate approximation of the global optimum. This framework was demonstrated by alignments among the genome-wide metabolic networks of Escherichia coli MG-1655 and Halophilic archaeon DL31. To our knowledge, this is the first genome-wide metabolic network alignment at both the metabolite level and the enzyme level. These results demonstrate that it can supply quite a few valuable insights into metabolic networks. First, this method can access the driving force of organic reactions through the chemical evolution of metabolic network. Second, this method can incorporate the chemical information of enzymes and structural changes of compounds to offer new way defining reaction class and module, such as those in KEGG. Third, as a vertex-focused treatment, this method can supply novel structural and functional annotation for ill-defined molecules. The related source code is available on request.

[1]  Tijana Milenkovic,et al.  MAGNA: Maximizing Accuracy in Global Network Alignment , 2013, Bioinform..

[2]  Chen Ling,et al.  The Best Rank-1 Approximation of a Symmetric Tensor and Related Spherical Optimization Problems , 2012, SIAM J. Matrix Anal. Appl..

[3]  Shi-Hua Zhang,et al.  Alignment of molecular networks by integer quadratic programming , 2007, Bioinform..

[4]  Johannes Berg,et al.  Cross-species analysis of biological networks by Bayesian alignment. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Nikos Paragios,et al.  Dense non-rigid surface registration using high-order graph matching , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Li Wang,et al.  Semidefinite Relaxations for Best Rank-1 Tensor Approximations , 2013, SIAM J. Matrix Anal. Appl..

[7]  Orland R. Gonzalez,et al.  Metabolism of halophilic archaea , 2008, Extremophiles.

[8]  Jugal K. Kalita,et al.  A comparison of algorithms for the pairwise alignment of biological networks , 2014, Bioinform..

[9]  Natasa Przulj,et al.  Integrative network alignment reveals large regions of global network similarity in yeast and human , 2011, Bioinform..

[10]  Arzucan Özgür,et al.  A comparative study of SMILES-based compound similarity functions for drug-target interaction prediction , 2016, BMC Bioinformatics.

[11]  Jean Ponce,et al.  A tensor-based algorithm for high-order graph matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Roded Sharan,et al.  PathBLAST: a tool for alignment of protein interaction networks , 2004, Nucleic Acids Res..

[13]  Pietro Hiram Guzzi,et al.  Survey of local and global biological network alignment: the need to reconcile the two sides of the same coin , 2017, Briefings Bioinform..

[14]  Steffen Klamt,et al.  Hypergraphs and Cellular Networks , 2009, PLoS Comput. Biol..

[15]  Christian E. V. Storm,et al.  Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. , 2001, Journal of molecular biology.

[16]  R. Wilson,et al.  The Next-Generation Sequencing Revolution and Its Impact on Genomics , 2013, Cell.

[17]  J. Munkres ALGORITHMS FOR THE ASSIGNMENT AND TRANSIORTATION tROBLEMS* , 1957 .

[18]  Tingting Zhou,et al.  Computational reconstruction of metabolic networks from KEGG. , 2013, Methods in molecular biology.

[19]  Tom Michoel,et al.  Alignment and integration of complex networks by hypergraph-based spectral clustering , 2012, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Sourav Bandyopadhyay,et al.  Systematic identification of functional orthologs based on protein network comparison. , 2006, Genome research.

[21]  Duc Dong Do,et al.  ACOGNA2: A novel algorithm for maximizing accuracy in global network alignment , 2019, 2019 6th NAFOSTED Conference on Information and Computer Science (NICS).

[22]  Yongjun Liu,et al.  An always convergent algorithm for the largest eigenvalue of an irreducible nonnegative tensor , 2010, J. Comput. Appl. Math..

[23]  Roland A. Pache,et al.  A Novel Framework for the Comparative Analysis of Biological Networks , 2012, PloS one.

[24]  Behnam Neyshabur,et al.  NETAL: a new graph-based method for global alignment of protein-protein interaction networks , 2013, Bioinform..

[25]  Michael K. Ng,et al.  Finding the Largest Eigenvalue of a Nonnegative Tensor , 2009, SIAM J. Matrix Anal. Appl..

[26]  Phillip A. Regalia,et al.  On the Best Rank-1 Approximation of Higher-Order Supersymmetric Tensors , 2001, SIAM J. Matrix Anal. Appl..

[27]  Bonnie Berger,et al.  Pairwise Global Alignment of Protein Interaction Networks by Matching Neighborhood Topology , 2007, RECOMB.

[28]  Edwin R. Hancock,et al.  Graph matching through entropic manifold alignment , 2011, CVPR 2011.

[29]  Jotun Hein,et al.  Rahnuma: hypergraph-based tool for metabolic pathway prediction and network comparison , 2009, Bioinform..

[30]  Marcel J. T. Reinders,et al.  Efficient calculation of compound similarity based on maximum common subgraphs and its application to prediction of gene transcript levels , 2013, Int. J. Bioinform. Res. Appl..

[31]  Serafim Batzoglou,et al.  Automatic Parameter Learning for Multiple Network Alignment , 2008, RECOMB.

[32]  Tamer Kahveci,et al.  SubMAP: Aligning Metabolic Pathways with Subnetwork Mappings , 2010, RECOMB.

[33]  Gene H. Golub,et al.  Symmetric Tensors and Symmetric Tensor Rank , 2008, SIAM J. Matrix Anal. Appl..

[34]  Tamara G. Kolda,et al.  Triangular Alignment (TAME): A Tensor-Based Approach for Higher-Order Network Alignment , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[35]  T. Ideker,et al.  Modeling cellular machinery through biological network comparison , 2006, Nature Biotechnology.

[36]  T. Ideker,et al.  Integrative approaches for finding modular structure in biological networks , 2013, Nature Reviews Genetics.

[37]  Robert Patro,et al.  Global network alignment using multiscale spectral signatures , 2012, Bioinform..

[38]  A. Barabasi,et al.  High-Quality Binary Protein Interaction Map of the Yeast Interactome Network , 2008, Science.

[39]  Yiqun Cao,et al.  ChemMine tools: an online service for analyzing and clustering small molecules , 2011, Nucleic Acids Res..

[40]  Jacques van Helden,et al.  Network Analysis Tools: from biological networks to clusters and pathways , 2008, Nature Protocols.

[41]  R. Karp,et al.  Conserved pathways within bacteria and yeast as revealed by global protein network alignment , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Natasa Przulj,et al.  Biological network comparison using graphlet degree distribution , 2007, Bioinform..

[43]  Adam M. Feist,et al.  What do cells actually want? , 2016, Genome Biology.

[44]  Y. Dimitrienko,et al.  Tensor Analysis and Nonlinear Tensor Functions , 2002 .

[45]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[46]  P. Lio’,et al.  Genome-scale metabolic network reconstruction. , 2015, Methods in molecular biology.

[47]  Tamara G. Kolda,et al.  Shifted Power Method for Computing Tensor Eigenpairs , 2010, SIAM J. Matrix Anal. Appl..

[48]  Antal F. Novak,et al.  networks Græmlin : General and robust alignment of multiple large interaction data , 2006 .

[49]  Minsu Cho,et al.  Hyper-graph matching via reweighted random walks , 2011, CVPR 2011.