From homogeneous to heterogeneous network alignment

Network alignment (NA) aims to find a node mapping between compared networks in order to uncover highly similar (i.e., conserved) network regions. Existing NA methods are homogeneous, i.e., they can deal only with networks containing nodes and edges of one type. However, due to increasing amounts of heterogeneous real-world network data with nodes and edges of different types, we extend three recent state-of-the-art homogeneous NA methods, WAVE, MAGNA++, and SANA, to allow for heterogeneous NA for the first time. We introduce several algorithmic novelties to achieve this. Namely, these existing NA methods compute homogeneous graphlet-based node similarities and then find high-scoring alignments with respect to these similarities, while simultaneously maximizing the number of conserved edges. Instead, we generalize homogeneous graphlets to their heterogeneous counterparts, which we then use to develop a new measure of heterogeneous node similarity. Also, we generalize $S^3$, a state-of-the-art measure of edge conservation for homogeneous NA, to its heterogeneous counterpart. Then, we find high-scoring alignments with respect to our heterogeneous node similarity and edge conservation measures. In evaluations on synthetic networks and real-world biological network data, our proposed heterogeneous NA methods lead to higher-quality alignments and better robustness to noise in the data than their homogeneous counterparts.

[1]  João Pedro de Magalhães Ageing research in the post-genome era: new technologies for an old problem. , 2009 .

[2]  Tijana Milenkovic,et al.  MAGNA: Maximizing Accuracy in Global Network Alignment , 2013, Bioinform..

[3]  Nataša Pržulj,et al.  Methods for biological data integration: perspectives and challenges , 2015, Journal of The Royal Society Interface.

[4]  Igor Jurisica,et al.  Modeling interactome: scale-free or geometric? , 2004, Bioinform..

[5]  Aaron Striegel,et al.  Local versus global biological network alignment , 2015, Bioinform..

[6]  Stephen A. Cook,et al.  The complexity of theorem-proving procedures , 1971, STOC.

[7]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[8]  T. Milenković,et al.  Systems biology MAGNA 11 : Maximizing Accuracy in Global Network Alignment via both node and edge conservation , 2015 .

[9]  Tijana Milenkovic,et al.  MAGNA++: Maximizing Accuracy in Global Network Alignment via both node and edge conservation , 2015, Bioinform..

[10]  Tijana Milenkoviæ,et al.  Uncovering Biological Network Function via Graphlet Degree Signatures , 2008, Cancer informatics.

[11]  João Pedro de Magalhães,et al.  Ageing research in the post-genome era: new technologies for an old problem. , 2009, SEB experimental biology series.

[12]  Han Zhao,et al.  Global network alignment in the context of aging , 2015, TCBB.

[13]  Carl W. Cotman,et al.  Gene expression changes in the course of normal brain aging are sexually dimorphic , 2008, Proceedings of the National Academy of Sciences.

[14]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2008 update , 2008, Nucleic Acids Res..

[15]  M. Stratton,et al.  The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website , 2004, British Journal of Cancer.

[16]  Wayne B. Hayes,et al.  SANA: simulated annealing far outperforms many other search algorithms for biological network alignment , 2017, Bioinform..

[17]  Jie Tang,et al.  Simultaneous Optimization of both Node and Edge Conservation in Network Alignment via WAVE , 2014, WABI.

[18]  C. Brayne,et al.  Microarray analysis of the astrocyte transcriptome in the aging brain: relationship to Alzheimer's pathology and APOE genotype , 2011, Neurobiology of Aging.