SANA: simulated annealing far outperforms many other search algorithms for biological network alignment

Summary: Every alignment algorithm consists of two orthogonal components: an objective function M measuring the quality of an alignment, and a search algorithm that explores the space of alignments looking for ones scoring well according to M. We introduce a new search algorithm called SANA (Simulated Annealing Network Aligner) and apply it to protein‐protein interaction networks using S3 as the topological measure. Compared against 12 recent algorithms, SANA produces 5‐10 times as many correct node pairings as the others when the correct answer is known. We expose an anti‐correlation in many existing aligners between their ability to produce good topological vs. functional similarity scores, whereas SANA usually outscores other methods in both measures. If given the perfect objective function encoding the identity mapping, SANA quickly converges to the perfect solution while many other algorithms falter. We observe that when aligning networks with a known mapping and optimizing only S3, SANA creates alignments that are not perfect and yet whose S3 scores match that of the perfect alignment. We call this phenomenon saturation of the topological score. Saturation implies that a measure's correlation with alignment correctness falters before the perfect alignment is reached. This, combined with SANA's ability to produce the perfect alignment if given the perfect objective function, suggests that better objective functions may lead to dramatically better alignments. We conclude that future work should focus on finding better objective functions, and offer SANA as the search algorithm of choice. Availability and Implementation: Software available at http://sana.ics.uci.edu. Contact: whayes@uci.edu Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[2]  H. Szu Fast simulated annealing , 1987 .

[3]  Jiong Guo,et al.  GEDEVO: An Evolutionary Graph Edit Distance Algorithm for Biological Network Alignment , 2013, GCB.

[4]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[5]  L. Ingber Very fast simulated re-annealing , 1989 .

[6]  Jinbo Xu,et al.  HubAlign: an accurate and efficient method for global alignment of protein–protein interaction networks , 2014, Bioinform..

[7]  Jugal K. Kalita,et al.  Global Alignment of Protein-Protein Interaction Networks: A Survey , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  Han Zhao,et al.  Global Network Alignment in the Context of Aging , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[9]  Bonnie Berger,et al.  IsoRankN: spectral methods for global alignment of multiple protein networks , 2009, Bioinform..

[10]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[11]  Jie Tang,et al.  Simultaneous Optimization of both Node and Edge Conservation in Network Alignment via WAVE , 2014, WABI.

[12]  Knut Reinert,et al.  NetCoffee: a fast and accurate global alignment approach to identify functionally conserved proteins in multiple networks , 2014, Bioinform..

[13]  Roded Sharan,et al.  PathBLAST: a tool for alignment of protein interaction networks , 2004, Nucleic Acids Res..

[14]  Jaap Heringa,et al.  Lagrangian Relaxation Applied to Sparse Global Network Alignment , 2011, PRIB.

[15]  Jugal K. Kalita,et al.  A multiobjective memetic algorithm for PPI network alignment , 2015, Bioinform..

[16]  Bonnie Berger,et al.  Global alignment of multiple protein interaction networks with application to functional orthology detection , 2008, Proceedings of the National Academy of Sciences.

[17]  Gunnar W. Klau,et al.  A new graph-based method for pairwise global network alignment , 2009, BMC Bioinformatics.

[18]  Robert Patro,et al.  Global network alignment using multiscale spectral signatures , 2012, Bioinform..

[19]  B. Berger,et al.  Herpesviral Protein Networks and Their Interaction with the Human Proteome , 2006, Science.

[20]  Yi Li,et al.  RiMOM: A Dynamic Multistrategy Ontology Alignment Framework , 2009, IEEE Transactions on Knowledge and Data Engineering.

[21]  Lei Meng,et al.  The post-genomic era of biological network alignment , 2015, EURASIP J. Bioinform. Syst. Biol..

[22]  Vesna Memisevic,et al.  Global G RAph A Lignment of Biological Networks , 2022 .

[23]  Vipin Vijayan,et al.  Multiple Network Alignment via MultiMAGNA++ , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[24]  P. Radivojac,et al.  An integrated approach to inferring gene–disease associations in humans , 2008, Proteins.

[25]  T. Ito,et al.  Toward a protein-protein interaction map of the budding yeast: A comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Tijana Milenkovic,et al.  MAGNA++: Maximizing Accuracy in Global Network Alignment via both node and edge conservation , 2015, Bioinform..

[27]  Natasa Przulj,et al.  Topology-function conservation in protein–protein interaction networks , 2015, Bioinform..

[28]  M. Zaslavskiy,et al.  A Path Following Algorithm for the Graph Matching Problem , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Sean R. Collins,et al.  Toward a Comprehensive Atlas of the Physical Interactome of Saccharomyces cerevisiae*S , 2007, Molecular & Cellular Proteomics.

[30]  Natasa Przulj,et al.  Fuse: multiple network alignment via data fusion , 2014, Bioinform..

[31]  Jugal K. Kalita,et al.  A comparison of algorithms for the pairwise alignment of biological networks , 2014, Bioinform..

[32]  Natasa Przulj,et al.  Integrative network alignment reveals large regions of global network similarity in yeast and human , 2011, Bioinform..

[33]  Natasa Przulj,et al.  L-GRAAL: Lagrangian graphlet-based network aligner , 2015, Bioinform..

[34]  D. Mitra,et al.  Convergence and finite-time behavior of simulated annealing , 1985, 1985 24th IEEE Conference on Decision and Control.

[35]  Wayne Hayes,et al.  Optimal Network Alignment with Graphlet Degree Vectors , 2010, Cancer informatics.

[36]  Yihan Sun,et al.  Fair evaluation of global network aligners , 2014, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[37]  Mario Albrecht,et al.  CytoGEDEVO - global alignment of biological networks with Cytoscape , 2016, Bioinform..

[38]  Tijana Milenkovic,et al.  MAGNA: Maximizing Accuracy in Global Network Alignment , 2013, Bioinform..

[39]  Aaron Striegel,et al.  Local versus global biological network alignment , 2015, Bioinform..

[40]  Ahmet Emre Aladag,et al.  SPINAL: scalable protein interaction network alignment , 2013, Bioinform..

[41]  Jie Tang,et al.  Social Network Integration: Towards Constructing the Social Graph , 2013, ArXiv.

[42]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[43]  Behnam Neyshabur,et al.  NETAL: a new graph-based method for global alignment of protein-protein interaction networks , 2013, Bioinform..

[44]  Rita Casadio,et al.  Algorithms in Bioinformatics, 5th International Workshop, WABI 2005, Mallorca, Spain, October 3-6, 2005, Proceedings , 2005, WABI.

[45]  V. Cerný Thermodynamical approach to the traveling salesman problem: An efficient simulation algorithm , 1985 .

[46]  S. Dreyfus,et al.  Thermodynamical Approach to the Traveling Salesman Problem : An Efficient Simulation Algorithm , 2004 .

[47]  Cesim Erten,et al.  BEAMS: backbone extraction and merge strategy for the global many-to-many alignment of multiple PPI networks , 2014, Bioinform..

[48]  Cheng-Yu Ma,et al.  Optimizing a global alignment of protein interaction networks , 2013, Bioinform..

[49]  Igor Jurisica,et al.  Modeling interactome: scale-free or geometric? , 2004, Bioinform..

[50]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[51]  Ying Wang,et al.  Algorithms for Large, Sparse Network Alignment Problems , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[52]  Igor Jurisica,et al.  A Simulated Annealing Algorithm for Maximum Common Edge Subgraph Detection in Biological Networks , 2016, GECCO.

[53]  Tijana Milenkovic,et al.  GREAT: GRaphlet Edge-based network AlignmenT , 2014, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[54]  O. Kuchaiev,et al.  Simulating trait evolution for cross-cultural comparison , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[55]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.