Global Alignment of Protein-Protein Interaction Networks: A Survey

In this paper, we survey algorithms that perform global alignment of networks or graphs. Global network alignment aligns two or more given networks to find the best mapping from nodes in one network to nodes in other networks. Since graphs are a common method of data representation, graph alignment has become important with many significant applications. Protein-protein interactions can be modeled as networks and aligning these networks of protein interactions has many applications in biological research. In this survey, we review algorithms for global pairwise alignment highlighting various proposed approaches, and classify them based on their methodology. Evaluation metrics that are used to measure the quality of the resulting alignments are also surveyed. We discuss and present a comparison between selected aligners on the same datasets and evaluate using the same evaluation metrics. Finally, a quick overview of the most popular databases of protein interaction networks is presented focusing on datasets that have been used recently.

[1]  Giorgios Kollias,et al.  Network Similarity Decomposition (NSD): A Fast and Scalable Approach to Network Alignment , 2012, IEEE Transactions on Knowledge and Data Engineering.

[2]  Jugal K. Kalita,et al.  A multiobjective memetic algorithm for PPI network alignment , 2015, Bioinform..

[3]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[4]  Cheng-Yu Ma,et al.  Optimizing a global alignment of protein interaction networks , 2013, Bioinform..

[5]  Meng Xu,et al.  NetAlign: a web-based tool for comparison of protein interaction networks , 2006, Bioinform..

[6]  Bonnie Berger,et al.  Global alignment of multiple protein interaction networks with application to functional orthology detection , 2008, Proceedings of the National Academy of Sciences.

[7]  Wayne Hayes,et al.  Optimal Network Alignment with Graphlet Degree Vectors , 2010, Cancer informatics.

[8]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[9]  Bonnie Berger,et al.  IsoBase: a database of functionally related proteins across PPI networks , 2010, Nucleic Acids Res..

[10]  Jennifer M. Rust,et al.  The BioGRID Interaction Database , 2011 .

[11]  Gunnar W. Klau,et al.  A new graph-based method for pairwise global network alignment , 2009, BMC Bioinformatics.

[12]  Vesna Memisevic,et al.  Global G RAph A Lignment of Biological Networks , 2022 .

[13]  Mário J. Silva,et al.  Measuring semantic similarity between Gene Ontology terms , 2007, Data Knowl. Eng..

[14]  Bonnie Berger,et al.  Pairwise Global Alignment of Protein Interaction Networks by Matching Neighborhood Topology , 2007, RECOMB.

[15]  Wei Wang,et al.  Graph Database Indexing Using Structured Graph Decomposition , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[16]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[17]  Rafael C. Jimenez,et al.  The IntAct molecular interaction database in 2012 , 2011, Nucleic Acids Res..

[18]  Roded Sharan,et al.  PathBLAST: a tool for alignment of protein interaction networks , 2004, Nucleic Acids Res..

[19]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[20]  Jaap Heringa,et al.  Lagrangian Relaxation Applied to Sparse Global Network Alignment , 2011, PRIB.

[21]  Natasa Przulj,et al.  Biological network comparison using graphlet degree distribution , 2007, Bioinform..

[22]  Knut Reinert,et al.  NetCoffee: a fast and accurate global alignment approach to identify functionally conserved proteins in multiple networks , 2014, Bioinform..

[23]  Roded Sharan,et al.  QNet: A Tool for Querying Protein Interaction Networks , 2007, RECOMB.

[24]  Shahin Mohammadi,et al.  A fast approach to global alignment of protein-protein interaction networks , 2013, BMC Research Notes.

[25]  Serafim Batzoglou,et al.  Automatic Parameter Learning for Multiple Network Alignment , 2008, RECOMB.

[26]  Tamer Kahveci,et al.  Topac: Alignment of gene Regulatory Networks Using Topology-Aware Coloring , 2012, J. Bioinform. Comput. Biol..

[27]  Srinivasan Parthasarathy,et al.  Scalable global alignment for multiple biological networks , 2012, BMC Bioinformatics.

[28]  T. Ideker,et al.  Modeling cellular machinery through biological network comparison , 2006, Nature Biotechnology.

[29]  GusfieldDan Introduction to the IEEE/ACM Transactions on Computational Biology and Bioinformatics , 2004 .

[30]  O. Kuchaiev,et al.  Topological network alignment uncovers biological function and phylogeny , 2008, Journal of The Royal Society Interface.

[31]  Wan Kyu Kim,et al.  Age-Dependent Evolution of the Yeast Protein Interaction Network Suggests a Limited Role of Gene Duplication and Divergence , 2008, PLoS Comput. Biol..

[32]  Sourav Bandyopadhyay,et al.  Systematic identification of functional orthologs based on protein network comparison. , 2006, Genome research.

[33]  Ricard V. Solé,et al.  A Model of Large-Scale proteome Evolution , 2002, Adv. Complex Syst..

[34]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[35]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[36]  Tijana Milenkovic,et al.  MAGNA: Maximizing Accuracy in Global Network Alignment , 2013, Bioinform..

[37]  Philip S. Yu,et al.  Graph indexing: a frequent structure-based approach , 2004, SIGMOD '04.

[38]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[39]  Connor Clark,et al.  Multiobjective Optimization for the Alignment of Protein Networks , 2014 .

[40]  Olivier Dameron,et al.  Semantic Particularity Measure for Functional Characterization of Gene Sets Using Gene Ontology , 2014, PloS one.

[41]  Ahmet Emre Aladag,et al.  SPINAL: scalable protein interaction network alignment , 2013, Bioinform..

[42]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[43]  Robert Preis,et al.  Linear Time 1/2-Approximation Algorithm for Maximum Weighted Matching in General Graphs , 1999, STACS.

[44]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[45]  A. Vespignani,et al.  Modeling of Protein Interaction Networks , 2001, Complexus.

[46]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[47]  Gabriele Ausiello,et al.  MINT: the Molecular INTeraction database , 2006, Nucleic Acids Res..

[48]  Byung-Jun Yoon,et al.  RESQUE: Network reduction using semi-Markov random walk scores for efficient querying of biological networks , 2012, Bioinform..

[49]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[50]  Predrag Radivojac,et al.  Information-theoretic evaluation of predicted ontological annotations , 2013, Bioinform..

[51]  Mam Riess Jones Color Coding , 1962, Human factors.

[52]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[53]  Behnam Neyshabur,et al.  NETAL: a new graph-based method for global alignment of protein-protein interaction networks , 2013, Bioinform..

[54]  Debnath Pal,et al.  On gene ontology and function annotation , 2006, Bioinformation.

[55]  Shijie Zhang,et al.  TreePi: A Novel Graph Indexing Method , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[56]  Wilfred Ng,et al.  Efficient query processing on graph databases , 2009, TODS.

[57]  Robert Patro,et al.  Global network alignment using multiscale spectral signatures , 2012, Bioinform..

[58]  Francis Bach,et al.  Global alignment of protein–protein interaction networks by graph matching methods , 2009, Bioinform..

[59]  M. Bernardine Dias,et al.  The Dynamic Hungarian Algorithm for the Assignment Problem with Changing Costs , 2007 .

[60]  Antal F. Novak,et al.  networks Græmlin : General and robust alignment of multiple large interaction data , 2006 .

[61]  Martial Hebert,et al.  A spectral technique for correspondence problems using pairwise constraints , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[62]  Anirban Banerjee,et al.  Structural distance and evolutionary relationship of networks , 2008, Biosyst..

[63]  Warren P. Adams,et al.  Improved Linear Programming-based Lower Bounds for the Quadratic Assignment Proglem , 1993, Quadratic Assignment and Related Problems.

[64]  Roded Sharan,et al.  QPath: a method for querying pathways in a protein-protein interaction network , 2006, BMC Bioinformatics.

[65]  Han Zhao,et al.  Global Network Alignment in the Context of Aging , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[66]  Mario Cannataro,et al.  Semantic similarity analysis of protein data: assessment with biological features and issues , 2012, Briefings Bioinform..

[67]  Bonnie Berger,et al.  IsoRankN: spectral methods for global alignment of multiple protein networks , 2009, Bioinform..

[68]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[69]  Natasa Przulj,et al.  L-GRAAL: Lagrangian graphlet-based network aligner , 2015, Bioinform..

[70]  Charlotte M Deane,et al.  Evolutionary analysis reveals low coverage as the major challenge for protein interaction network alignment. , 2010, Molecular bioSystems.

[71]  Jugal K. Kalita,et al.  A comparison of algorithms for the pairwise alignment of biological networks , 2014, Bioinform..

[72]  Natasa Przulj,et al.  Integrative network alignment reveals large regions of global network similarity in yeast and human , 2011, Bioinform..

[73]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[74]  R. Solé,et al.  Evolving protein interaction networks through gene duplication. , 2003, Journal of theoretical biology.

[75]  Scott Kirkpatrick,et al.  Optimization by Simmulated Annealing , 1983, Sci..

[76]  Chittibabu Guda,et al.  Comparative Analysis of Protein-Protein Interactions in Cancer-Associated Genes , 2009, Genom. Proteom. Bioinform..

[77]  Cesim Erten,et al.  BEAMS: backbone extraction and merge strategy for the global many-to-many alignment of multiple PPI networks , 2014, Bioinform..

[78]  Richard M. Karp,et al.  The traveling-salesman problem and minimum spanning trees: Part II , 1971, Math. Program..

[79]  Michael Lässig,et al.  Local graph alignment and motif search in biological networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[80]  Wojciech Szpankowski,et al.  Pairwise Alignment of Protein Interaction Networks , 2006, J. Comput. Biol..

[81]  Tamer Kahveci,et al.  Color distribution can accelerate network alignment , 2013, BCB.

[82]  Tamer Kahveci,et al.  Incremental network querying in biological networks , 2014, BCB.

[83]  Byung-Jun Yoon,et al.  SMETANA: Accurate and Scalable Algorithm for Probabilistic Alignment of Large-Scale Biological Networks , 2013, PloS one.

[84]  Byung-Jun Yoon,et al.  A Network Synthesis Model for Generating Protein Interaction Network Families , 2012, PloS one.

[85]  Phillip W. Lord,et al.  Semantic Similarity in Biomedical Ontologies , 2009, PLoS Comput. Biol..

[86]  Dimitri P. Bertsekas,et al.  A forward/reverse auction algorithm for asymmetric assignment problems , 1992, Comput. Optim. Appl..

[87]  Stephen A. Cook,et al.  The complexity of theorem-proving procedures , 1971, STOC.

[88]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[89]  Bonnie Berger,et al.  Local Optimization for Global Alignment of Protein Interaction Networks , 2010, Pacific Symposium on Biocomputing.

[90]  Michael J. E. Sternberg,et al.  PINALOG: a novel approach to align protein interaction networks—implications for complex detection and function prediction , 2012, Bioinform..

[91]  Fan Chung Graham,et al.  Local Graph Partitioning using PageRank Vectors , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[92]  Philip S. Yu,et al.  Graph indexing based on discriminative frequent structure analysis , 2005, TODS.