GMAlign: A new network aligner for revealing large conserved functional components

The alignment of protein-protein interaction (PPI) networks is an effective approach to uncover the functionally conserved sub-structure between networks. A wealth of approaches have been developed for global PPI network alignment in recent years. However, due to the computational intractability caused by its NP-completeness, global PPI network alignment remains challenging in finding large conserved components stably for various PPI network pairs. In this paper, we introduce a novel global network aligner based on graph matching method called GMAlign. We assess the outperformance of GMAlign over the state-of-the-art network aligners on various PPI network pairs from the largest BioGRID dataset. It is shown that GMAlign not only can produce larger size alignment, but also can find bigger and denser common connected subgraphs robustly for the first time. Moreover, we shows that GMAlign can produce both structurally and functionally meaningful results in detecting large conserved biological pathways between species. The GMAlign software, datasets and supplementary experimental results can be downloaded at https://github.com/yzlwhu/GMAlign.

[1]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[2]  Natasa Przulj,et al.  L-GRAAL: Lagrangian graphlet-based network aligner , 2015, Bioinform..

[3]  Tijana Milenkovic,et al.  MAGNA++: Maximizing Accuracy in Global Network Alignment via both node and edge conservation , 2015, Bioinform..

[4]  O. Kuchaiev,et al.  Simulating trait evolution for cross-cultural comparison , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[5]  Tijana Milenkovic,et al.  MAGNA: Maximizing Accuracy in Global Network Alignment , 2013, Bioinform..

[6]  A. von Haeseler,et al.  The evolution of the ribosome biogenesis pathway from a yeast perspective , 2013, Nucleic acids research.

[7]  Vesna Memisevic,et al.  Global G RAph A Lignment of Biological Networks , 2022 .

[8]  Jugal K. Kalita,et al.  A comparison of algorithms for the pairwise alignment of biological networks , 2014, Bioinform..

[9]  R. Karp,et al.  Conserved pathways within bacteria and yeast as revealed by global protein network alignment , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Natasa Przulj,et al.  Integrative network alignment reveals large regions of global network similarity in yeast and human , 2011, Bioinform..

[11]  B. Berger,et al.  Herpesviral Protein Networks and Their Interaction with the Human Proteome , 2006, Science.

[12]  Wojciech Szpankowski,et al.  Pairwise Alignment of Protein Interaction Networks , 2006, J. Comput. Biol..

[13]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[14]  Jinbo Xu,et al.  HubAlign: an accurate and efficient method for global alignment of protein–protein interaction networks , 2014, Bioinform..

[15]  Ahmet Emre Aladag,et al.  SPINAL: scalable protein interaction network alignment , 2013, Bioinform..

[16]  M. Zaslavskiy,et al.  A Path Following Algorithm for the Graph Matching Problem , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Sourav Bandyopadhyay,et al.  Systematic identification of functional orthologs based on protein network comparison. , 2006, Genome research.

[18]  Jeffrey Xu Yu,et al.  High efficiency and quality: large graphs matching , 2011, CIKM '11.

[19]  Radu Horaud,et al.  Inexact Matching of Large and Sparse Graphs Using Laplacian Eigenvectors , 2009, GbRPR.

[20]  Roded Sharan,et al.  PathBLAST: a tool for alignment of protein interaction networks , 2004, Nucleic Acids Res..

[21]  Jaap Heringa,et al.  Lagrangian Relaxation Applied to Sparse Global Network Alignment , 2011, PRIB.

[22]  T. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2010, Nucleic Acids Res..

[23]  Nagiza F. Samatova,et al.  The Maximum Common Subgraph Problem: Faster Solutions via Vertex Cover , 2007, 2007 IEEE/ACS International Conference on Computer Systems and Applications.

[24]  Robert Patro,et al.  Global network alignment using multiscale spectral signatures , 2012, Bioinform..

[25]  Francis Bach,et al.  Global alignment of protein–protein interaction networks by graph matching methods , 2009, Bioinform..

[26]  Bonnie Berger,et al.  Pairwise Global Alignment of Protein Interaction Networks by Matching Neighborhood Topology , 2007, RECOMB.

[27]  Antal F. Novak,et al.  networks Græmlin : General and robust alignment of multiple large interaction data , 2006 .

[28]  P. Uetz,et al.  The binary protein-protein interaction landscape of Escherichia coli , 2014, Nature Biotechnology.

[29]  Behnam Neyshabur,et al.  NETAL: a new graph-based method for global alignment of protein-protein interaction networks , 2013, Bioinform..

[30]  Shinji Umeyama,et al.  An Eigendecomposition Approach to Weighted Graph Matching Problems , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Jiawei Han,et al.  Mining coherent dense subgraphs across massive biological networks for functional discovery , 2005, ISMB.

[32]  Serafim Batzoglou,et al.  Automatic Parameter Learning for Multiple Network Alignment , 2008, RECOMB.