Multiple Anonymized Social Networks Alignment

Users nowadays are normally involved in multiple (usually more than two) online social networks simultaneously to enjoy more social network services. Some of the networks that users are involved in can share common structures either due to the analogous network construction purposes or because of the similar social network features. However, the social network datasets available in research are usually pre-anonymized and accounts of the shared users in different networks are mostly isolated without any known connections. In this paper, we want to identify such connections between the shared users' accounts in multiple social networks (i.e., the anchor links), which is formally defined as the M-NASA (Multiple Anonymized Social Networks Alignment) problem. M-NASA is very challenging to address due to (1) the lack of known anchor links to build models, (2) the studied networks are anonymized, where no users' personal profile or attribute information is available, and (3) the "transitivity law" and the "one-to-one property" based constraints on anchor links. To resolve these challenges, a novel two-phase network alignment framework UMA (Unsupervised Multi-network Alignment) is proposed in this paper. Extensive experiments conducted on multiple real-world partially aligned social networks demonstrate that UMA can perform very well in solving the M-NASA problem.

[1]  Jun Huan,et al.  GPM: A graph pattern matching kernel with diffusion for chemical compound classification , 2008, 2008 8th IEEE International Conference on BioInformatics and BioEngineering.

[2]  Philip S. Yu,et al.  Influence Maximization Across Partially Aligned Heterogenous Social Networks , 2015, PAKDD.

[3]  Philip S. Yu,et al.  Integrated Anchor and Social Link Predictions across Social Networks , 2015, IJCAI.

[4]  Philip S. Yu,et al.  Predicting Social Links for New Users across Aligned Heterogeneous Social Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.

[5]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[6]  Bhaskar D. Rao,et al.  L_0-norm Minimization for Basis Selection , 2004, NIPS.

[7]  Bonnie Berger,et al.  Pairwise Global Alignment of Protein Interaction Networks by Matching Neighborhood Topology , 2007, RECOMB.

[8]  Marina Fruehauf,et al.  Nonlinear Programming Analysis And Methods , 2016 .

[9]  Bonnie Berger,et al.  IsoRankN: spectral methods for global alignment of multiple protein networks , 2009, Bioinform..

[10]  Mahantesh Halappanavar,et al.  New Effective Multithreaded Matching Algorithms , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.

[11]  F. Stephan,et al.  Set theory , 2018, Mathematical Statistics with Applications in R.

[12]  Philip S. Yu,et al.  Partial Network Alignment with Anchor Meta Path and Truncated Generic Stable Matching , 2015, ArXiv.

[13]  Danai Koutra,et al.  BIG-ALIGN: Fast Bipartite Graph Alignment , 2013, 2013 IEEE 13th International Conference on Data Mining.

[14]  Bonnie Berger,et al.  IsoBase: a database of functionally related proteins across PPI networks , 2010, Nucleic Acids Res..

[15]  Philip S. Yu,et al.  MCD: Mutual Clustering across Multiple Social Networks , 2015, 2015 IEEE International Congress on Big Data.

[16]  Cynthia Dwork,et al.  Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography , 2007, WWW '07.

[17]  Philip S. Yu,et al.  Synergistic partitioning in multiple large scale social networks , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[18]  Bonnie Berger,et al.  Global alignment of multiple protein interaction networks with application to functional orthology detection , 2008, Proceedings of the National Academy of Sciences.

[19]  Reza Zafarani,et al.  Connecting users across social media sites: a behavioral-modeling approach , 2013, KDD.

[20]  Lise Getoor,et al.  Collective entity resolution in relational data , 2007, TKDD.

[21]  Ying Wang,et al.  Algorithms for Large, Sparse Network Alignment Problems , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[22]  Jeong-Hoon Lee,et al.  An In-depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases , 2012, Proc. VLDB Endow..

[23]  Philip S. Yu,et al.  Meta-path based multi-network collective link prediction , 2014, KDD.

[24]  Shinji Umeyama,et al.  An Eigendecomposition Approach to Weighted Graph Matching Problems , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Philip S. Yu,et al.  Transferring heterogeneous links across location-based social networks , 2014, WSDM.

[26]  Roded Sharan,et al.  Fast and Accurate Alignment of Multiple Protein Networks , 2009, J. Comput. Biol..

[27]  Philip S. Yu,et al.  Inferring anchor links across multiple heterogeneous social networks , 2013, CIKM.

[28]  Pedro M. Domingos,et al.  Ontology Matching: A Machine Learning Approach , 2004, Handbook on Ontologies.

[29]  Philip S. Yu,et al.  PNA: Partial Network Alignment with Generic Stable Matching , 2015, 2015 IEEE International Conference on Information Reuse and Integration.

[30]  David F. Gleich,et al.  A multithreaded algorithm for network alignment via approximate matching , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[31]  Srinivasan Parthasarathy,et al.  Scalable global alignment for multiple biological networks , 2012, BMC Bioinformatics.

[32]  B. Rao,et al.  ℓâ‚€-norm Minimization for Basis Selection , 2004, NIPS 2004.

[33]  Mario Vento,et al.  Thirty Years Of Graph Matching In Pattern Recognition , 2004, Int. J. Pattern Recognit. Artif. Intell..

[34]  Sherali Zeadally,et al.  Multiple Account Identity Deception Detection in Social Media Using Nonverbal Behavior , 2014, IEEE Transactions on Information Forensics and Security.

[35]  Antal F. Novak,et al.  networks Græmlin : General and robust alignment of multiple large interaction data , 2006 .

[36]  Erhard Rahm,et al.  Similarity flooding: a versatile graph matching algorithm and its application to schema matching , 2002, Proceedings 18th International Conference on Data Engineering.

[37]  Philip S. Yu,et al.  Community Detection for Emerging Networks , 2015, SDM.

[38]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .