Global network alignment using multiscale spectral signatures

MOTIVATION Protein interaction networks provide an important system-level view of biological processes. One of the fundamental problems in biological network analysis is the global alignment of a pair of networks, which puts the proteins of one network into correspondence with the proteins of another network in a manner that conserves their interactions while respecting other evidence of their homology. By providing a mapping between the networks of different species, alignments can be used to inform hypotheses about the functions of unannotated proteins, the existence of unobserved interactions, the evolutionary divergence between the two species and the evolution of complexes and pathways. RESULTS We introduce GHOST, a global pairwise network aligner that uses a novel spectral signature to measure topological similarity between subnetworks. It combines a seed-and-extend global alignment phase with a local search procedure and exceeds state-of-the-art performance on several network alignment tasks. We show that the spectral signature used by GHOST is highly discriminative, whereas the alignments it produces are also robust to experimental noise. When compared with other recent approaches, we find that GHOST is able to recover larger and more biologically significant, shared subnetworks between species. AVAILABILITY An efficient and parallelized implementation of GHOST, released under the Apache 2.0 license, is available at http://cbcb.umd.edu/kingsford_group/ghost CONTACT rob@cs.umd.edu.

[1]  Bonnie Berger,et al.  Local Optimization for Global Alignment of Protein Interaction Networks , 2010, Pacific Symposium on Biocomputing.

[2]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[3]  David M. Mount,et al.  Isomorphism of graphs with bounded eigenvalue multiplicity , 1982, STOC '82.

[4]  Bonnie Berger,et al.  IsoBase: a database of functionally related proteins across PPI networks , 2010, Nucleic Acids Res..

[5]  Roded Sharan,et al.  Global alignment of protein-protein interaction networks. , 2013, Methods in molecular biology.

[6]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[7]  Bonnie Berger,et al.  Global alignment of multiple protein interaction networks with application to functional orthology detection , 2008, Proceedings of the National Academy of Sciences.

[8]  Gunnar W. Klau,et al.  A new graph-based method for pairwise global network alignment , 2009, BMC Bioinformatics.

[9]  Antal F. Novak,et al.  networks Græmlin : General and robust alignment of multiple large interaction data , 2006 .

[10]  Jean Ponce,et al.  A tensor-based algorithm for high-order graph matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Leonidas J. Guibas,et al.  A metric for distributions with applications to image databases , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[12]  Jaap Heringa,et al.  Lagrangian Relaxation Applied to Sparse Global Network Alignment , 2011, PRIB.

[13]  Anirban Banerjee,et al.  Structural distance and evolutionary relationship of networks , 2008, Biosyst..

[14]  Roberto Marcondes Cesar Junior,et al.  Sparse Representations for Efficient Shape Matching , 2010, 2010 23rd SIBGRAPI Conference on Graphics, Patterns and Images.

[15]  Victor Y. Pan,et al.  The complexity of the matrix eigenproblem , 1999, STOC '99.

[16]  Martial Hebert,et al.  A spectral technique for correspondence problems using pairwise constraints , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[17]  Serafim Batzoglou,et al.  Automatic Parameter Learning for Multiple Local Network Alignment , 2009, J. Comput. Biol..

[18]  Henryk Wozniakowski,et al.  Estimating the Largest Eigenvalue by the Power and Lanczos Algorithms with a Random Start , 1992, SIAM J. Matrix Anal. Appl..

[19]  Julie A. Hines,et al.  A proteome-wide protein interaction map for Campylobacter jejuni , 2007, Genome Biology.

[20]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[21]  S. Fields,et al.  A novel genetic system to detect protein–protein interactions , 1989, Nature.

[22]  Sean R. Collins,et al.  Toward a Comprehensive Atlas of the Physical Interactome of Saccharomyces cerevisiae*S , 2007, Molecular & Cellular Proteomics.

[23]  J. Kuczy,et al.  Estimating the Largest Eigenvalue by the Power and Lanczos Algorithms with a Random Start , 1992 .

[24]  Susumu Goto,et al.  The KEGG resource for deciphering the genome , 2004, Nucleic Acids Res..

[25]  Victor M. Preciado,et al.  From local measurements to network spectral properties: Beyond degree distributions , 2010, 49th IEEE Conference on Decision and Control (CDC).

[26]  U. Leser,et al.  Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction , 2010, BMC Genomics.

[27]  Haruki Nakamura,et al.  HitPredict: a database of quality assessed protein–protein interactions in nine species , 2010, Nucleic Acids Res..

[28]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[29]  Natasa Przulj,et al.  Integrative network alignment reveals large regions of global network similarity in yeast and human , 2011, Bioinform..

[30]  Michael Werman,et al.  The Quadratic-Chi Histogram Distance Family , 2010, ECCV.

[31]  Sampsa Hautaniemi,et al.  Fast Gene Ontology based clustering for microarray experiments , 2008, BioData Mining.

[32]  Qianchuan He,et al.  BIOINFORMATICS ORIGINAL PAPER , 2022 .

[33]  Bonnie Berger,et al.  IsoRankN: spectral methods for global alignment of multiple protein networks , 2009, Bioinform..

[34]  Vladimir Kolmogorov,et al.  Feature Correspondence Via Graph Matching: Models and Global Optimization , 2008, ECCV.

[35]  Chong Su,et al.  The Modular Organization of Protein Interactions in Escherichia coli , 2009, PLoS Comput. Biol..

[36]  Sourav Bandyopadhyay,et al.  Systematic identification of functional orthologs based on protein network comparison. , 2006, Genome research.

[37]  Weng Leong Optimal network Alignment with Graphlet Degree Vectors , 2010 .

[38]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[39]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[40]  Phillip W. Lord,et al.  Semantic Similarity in Biomedical Ontologies , 2009, PLoS Comput. Biol..

[41]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[42]  Ping Zhu,et al.  A study of graph spectra for comparing graphs and trees , 2008, Pattern Recognit..

[43]  Chitta Baral,et al.  Pairwise Alignment of Interaction Networks by Fast Identification of Maximal Conserved Patterns , 2008, Pacific Symposium on Biocomputing.

[44]  Yasukazu Nakamura,et al.  A Large Scale Analysis of Protein–Protein Interactions in the Nitrogen-fixing Bacterium Mesorhizobium loti , 2008, DNA research : an international journal for rapid publication of reports on genes and genomes.

[45]  O. Kuchaiev,et al.  Simulating trait evolution for cross-cultural comparison , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.