Variational Cross-Network Embedding for Anonymized User Identity Linkage

User identity linkage (UIL) task aims to infer the identical users between different social networks/platforms. Existing models leverage the labeled inter-linkages or high-quality user attributes to make predictions. Nevertheless, it is often difficult or even impossible to obtain such information in real-world applications. To this end, we in this paper focus on studying an Anonymized User Identity Linkage (AUIL) problem wherein neither labeled anchor users nor attributes are available. To handle such a practical and challenging task, we propose a novel and concise unsupervised embedding method, VCNE, by utilizing the network structural information. Concretely, considering the inherent properties of structural diversity in the AUIL problem, we introduce a variational cross-network embedding learning framework to jointly study the Gaussian embeddings instead of the existing deterministic embedding from the angle of vector space. The multi-facet experiments on both real-world and synthetic datasets demonstrate that VCNE not only outperforms all baselines to a large extent but also be more robust to the different-level diversities and sparsities of the networks.

[1]  Xiaolong Jin,et al.  Predict Anchor Links across Social Networks via an Embedding Approach , 2016, IJCAI.

[2]  Cynthia Dwork,et al.  Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography , 2007, WWW '07.

[3]  Jing Xiao,et al.  User Identity Linkage by Latent User Space Modelling , 2016, KDD.

[4]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Hongyuan Zha,et al.  Gromov-Wasserstein Learning for Graph Matching and Node Embedding , 2019, ICML.

[6]  Xiaolong Jin,et al.  Cross-Domain Recommendation: An Embedding and Mapping Approach , 2017, IJCAI.

[7]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[8]  Pushmeet Kohli,et al.  Graph Matching Networks for Learning the Similarity of Graph Structured Objects , 2019, ICML.

[9]  Philip S. Yu,et al.  Integrated Anchor and Social Link Predictions across Social Networks , 2015, IJCAI.

[10]  Xavier Bresson,et al.  Benchmarking Graph Neural Networks , 2020, ArXiv.

[11]  Li Liu,et al.  Aligning Users across Social Networks Using Network Embedding , 2016, IJCAI.

[12]  Philip S. Yu,et al.  PCT: Partial Co-Alignment of Social Networks , 2016, WWW.

[13]  REGAL , 2018, Proceedings of the 27th ACM International Conference on Information and Knowledge Management.

[14]  C. Givens,et al.  A class of Wasserstein metrics for probability distributions. , 1984 .

[15]  Mark Heimann,et al.  REGAL: Representation Learning-based Graph Alignment , 2018, CIKM.

[16]  Hongyuan Zha,et al.  Joint Link Prediction and Network Alignment via Cross-graph Embedding , 2019, IJCAI.

[17]  L. Ambrosio,et al.  Gradient Flows: In Metric Spaces and in the Space of Probability Measures , 2005 .

[18]  Zhoujun Li,et al.  Partially Shared Adversarial Learning For Semi-supervised Multi-platform User Identity Linkage , 2019, CIKM.

[19]  Changsheng Xu,et al.  Friend transfer: Cold-start friend recommendation with cross-platform transfer learning of social knowledge , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[20]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[21]  MEgo2Vec , 2018, Proceedings of the 27th ACM International Conference on Information and Knowledge Management.

[22]  Hanghang Tong,et al.  FINAL: Fast Attributed Network Alignment , 2016, KDD.

[23]  Katarzyna Musial,et al.  Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction , 2020, KDD.

[24]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[25]  Ramayya Krishnan,et al.  HYDRA: large-scale social identity linkage via heterogeneous behavior modeling , 2014, SIGMOD Conference.

[26]  Philip S. Yu,et al.  Multiple Anonymized Social Networks Alignment , 2015, 2015 IEEE International Conference on Data Mining.

[27]  Xiaoming Zhang,et al.  Distribution Distance Minimization for Unsupervised User Identity Linkage , 2018, CIKM.

[28]  Philip S. Yu,et al.  Inferring anchor links across multiple heterogeneous social networks , 2013, CIKM.

[29]  Danai Koutra,et al.  BIG-ALIGN: Fast Bipartite Graph Alignment , 2013, 2013 IEEE 13th International Conference on Data Mining.

[30]  Jingping Bi,et al.  LRHNE: A Latent-Relation Enhanced Embedding Method for Heterogeneous Information Networks , 2020, CIKM.

[31]  Xiaoyong Du,et al.  Structure Based User Identification across Social Networks , 2018, IEEE Transactions on Knowledge and Data Engineering.

[32]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[33]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[34]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[35]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[36]  Jingping Bi,et al.  Cross-Network Embedding for Multi-Network Alignment , 2019, WWW.

[37]  Max Welling,et al.  Variational Graph Auto-Encoders , 2016, ArXiv.

[38]  Hong Chen,et al.  MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks , 2018, CIKM.

[39]  Claude Castelluccia,et al.  How Unique and Traceable Are Usernames? , 2011, PETS.

[40]  Mark Heimann,et al.  G-CREWE: Graph CompREssion With Embedding for Network Alignment , 2020, CIKM.

[41]  Bonnie Berger,et al.  IsoRankN: spectral methods for global alignment of multiple protein networks , 2009, Bioinform..

[42]  Mark Heimann,et al.  CONE-Align: Consistent Network Alignment with Proximity-Preserving Node Embedding , 2020, CIKM.