Structural representation learning for network alignment with self-supervised anchor links

Abstract Network alignment, the problem of identifying similar nodes across networks, is an emerging research topic due to its ubiquitous applications in many data domains such as social-network reconciliation and protein-network analysis. While traditional alignment methods struggle to scale to large graphs, the state-of-the-art representation-based methods often rely on pre-defined anchor links, which are unavailable or expensive to compute in many applications. In this paper, we propose NAWAL, a novel, end-to-end unsupervised embedding-based network alignment framework emphasizing on structural information. The model first embeds network nodes into a low-dimension space where the structural neighborhoodship on original network is captured by the distance on the space. As the space for the input networks are learnt independently, we further leverage a generative adversarial deep neural network to reconcile the spaces without relying on hand-crafted features or domain-specific supervision. The empirical results on three real-world datasets show that NAWAL significantly outperforms state-of-the-art baselines, by over 13% of accuracy against unsupervised methods and on par or better than supervised methods. Our technique also demonstrate the robustness against adversarial conditions, such as structural noises and graph size imbalance.

[1]  Wenpu Xing,et al.  Weighted PageRank algorithm , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[2]  Lejian Liao,et al.  Structural Representation Learning for User Alignment Across Social Networks , 2020, IEEE Transactions on Knowledge and Data Engineering.

[3]  Aditya Khamparia,et al.  A comprehensive survey of edge prediction in social networks: Techniques, parameters and challenges , 2019, Expert Syst. Appl..

[4]  Weinan Zhang,et al.  A Bootstrapping Framework With Interactive Information Modeling for Network Alignment , 2018, IEEE Access.

[5]  Moustapha Cissé,et al.  Parseval Networks: Improving Robustness to Adversarial Examples , 2017, ICML.

[6]  Mehran Mohsenzadeh,et al.  Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs , 2020, Expert Syst. Appl..

[7]  Yiqun Liu,et al.  Online Social Network Profile Linkage , 2014, AIRS.

[8]  Ali Kamandi,et al.  A learning-based ontology alignment approach using inductive logic programming , 2019, Expert Syst. Appl..

[9]  Phuc Do,et al.  W-MetaPath2Vec: The topic-driven meta-path-based model for large-scaled content-based heterogeneous information network representation learning , 2019, Expert Syst. Appl..

[10]  Hongzhi Yin,et al.  Streaming Session-based Recommendation , 2019, KDD.

[11]  Quoc Viet Hung Nguyen,et al.  User Guidance for Efficient Fact Checking , 2019, Proc. VLDB Endow..

[12]  Matthias Weidlich,et al.  From Anomaly Detection to Rumour Detection using Data Streams of Social Platforms , 2019, Proc. VLDB Endow..

[13]  Ramayya Krishnan,et al.  HYDRA: large-scale social identity linkage via heterogeneous behavior modeling , 2014, SIGMOD Conference.

[14]  Zi Huang,et al.  Knowledge-Based Systems , 2022 .

[15]  Bai Wang,et al.  Attention Based Meta Path Fusion for Heterogeneous Information Network Embedding , 2018, PRICAI.

[16]  Bela Stantic,et al.  Diversifying Group Recommendation , 2018, IEEE Access.

[17]  Min Zhang,et al.  Structural correlation between communities and core-periphery structures in social networks: Evidence from Twitter data , 2017, Expert Syst. Appl..

[18]  Jiye Liang,et al.  Exploiting user-to-user topic inclusion degree for link prediction in social-information networks , 2018, Expert Syst. Appl..

[19]  Rui Yan,et al.  AIR: Attentional Intention-Aware Recommender Systems , 2019, 2019 IEEE 35th International Conference on Data Engineering (ICDE).

[20]  Kevin Chen-Chuan Chang,et al.  A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[21]  Karl Aberer,et al.  Pay-as-you-go reconciliation in schema matching networks , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[22]  Karl Aberer,et al.  An Evaluation of Aggregation Techniques in Crowdsourcing , 2013, WISE.

[23]  Anuraj Mohan,et al.  A social recommender system using deep architecture and network embedding , 2018, Applied Intelligence.

[24]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[25]  François Laviolette,et al.  Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[26]  Yong Cao,et al.  CoLink: An Unsupervised Framework for User Identity Linkage , 2018, AAAI.

[27]  Philip S. Yu,et al.  Inferring anchor links across multiple heterogeneous social networks , 2013, CIKM.

[28]  P. Schönemann,et al.  A generalized solution of the orthogonal procrustes problem , 1966 .

[29]  Lejian Liao,et al.  Image Captioning with Relational Knowledge , 2018, PRICAI.

[30]  Xiaoping Zhou,et al.  Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks , 2016, IEEE Transactions on Knowledge and Data Engineering.

[31]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[32]  Tsuyoshi Murata,et al.  Network Embedding Based on a Quasi-Local Similarity Measure , 2018, PRICAI.

[33]  Pradeep Kumar,et al.  HeteClass: A Meta-path based framework for transductive classification of objects in heterogeneous information networks , 2017, Expert Syst. Appl..

[34]  Mariá Cristina Vasconcelos Nascimento,et al.  GA-LP: A genetic algorithm based on Label Propagation to detect communities in directed networks , 2017, Expert Syst. Appl..

[35]  Limin Xiao,et al.  Revealing the densest communities of social networks efficiently through intelligent data space reduction , 2018, Expert Syst. Appl..

[36]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[37]  Fan Zhang,et al.  What's in a name?: an unsupervised approach to link users across communities , 2013, WSDM.

[38]  Karl Aberer,et al.  Minimizing Efforts in Validating Crowd Answers , 2015, SIGMOD Conference.

[39]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[40]  Hong Chen,et al.  MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks , 2018, CIKM.

[41]  Changqing Yao,et al.  Classification by multi-semantic meta path and active weight learning in heterogeneous information networks , 2019, Expert Syst. Appl..

[42]  Bonnie Berger,et al.  Global alignment of multiple protein interaction networks with application to functional orthology detection , 2008, Proceedings of the National Academy of Sciences.

[43]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[44]  Derong Shen,et al.  Inferring Anchor Links Based on Social Network Structure , 2018, IEEE Access.

[45]  Si Zhang,et al.  Graph convolutional networks: a comprehensive review , 2019, Computational Social Networks.

[46]  Giorgios Kollias,et al.  Network Similarity Decomposition (NSD): A Fast and Scalable Approach to Network Alignment , 2012, IEEE Transactions on Knowledge and Data Engineering.

[47]  Ying Wang,et al.  Algorithms for Large, Sparse Network Alignment Problems , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[48]  Peter Fankhauser,et al.  Identifying Users Across Social Tagging Systems , 2011, ICWSM.

[49]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[50]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[51]  Jinbo Xu,et al.  HubAlign: an accurate and efficient method for global alignment of protein–protein interaction networks , 2014, Bioinform..

[52]  Philip S. Yu,et al.  Multiple Anonymized Social Networks Alignment , 2015, 2015 IEEE International Conference on Data Mining.