Second-level degree-based entity resolution in online social networks

Abundance of online social platforms allows users to create more than one profile in different social networks. Several issues arise due to multiple user identities including data integration and information retrieval in social networks. Identifying user profiles across social platforms is known as entity resolution. In this paper, we propose a degree-based method to attack entity resolution problems. More precisely, we utilize the degree of users and their friends to identify user profiles. Our results show that, without help of critical information such as e-mail addresses, the proposed method can outperform existing string matching-based solutions as well as popular classifiers such as SVM and Naive Bayes.

[1]  Philip S. Yu,et al.  COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency , 2015, KDD.

[2]  Ben Shneiderman,et al.  D-Dupe: An Interactive Tool for Entity Resolution in Social Networks , 2006, Graph Drawing.

[3]  Lise Getoor,et al.  Collective entity resolution in relational data , 2007, TKDD.

[4]  Gergo Barta A Link-based Approach to Entity Resolution in Social Networks , 2014, CSE 2014.

[5]  Nor Badrul Anuar,et al.  Malicious accounts: Dark of the social networks , 2017, J. Netw. Comput. Appl..

[6]  Ben Shneiderman,et al.  D-Dupe: An Interactive Tool for Entity Resolution in Social Networks , 2006, 2006 IEEE Symposium On Visual Analytics Science And Technology.

[7]  David Guy Brizan,et al.  A. Survey of Entity Resolution and Record Linkage Methodologies , 2015, Communications of the IIMA.

[8]  Matthew Richardson,et al.  Markov Logic: A Language and Algorithms for Link Mining , 2010, Link Mining.

[9]  William M. Campbell,et al.  Cross-Domain Entity Resolution in Social Media , 2016, ArXiv.

[10]  Ashwin Machanavajjhala,et al.  Network sampling , 2013, KDD.

[11]  Kim-Kwang Raymond Choo,et al.  User profiling in intrusion detection: A review , 2016, J. Netw. Comput. Appl..

[12]  Miguel Labrador,et al.  An accurate way to cross reference users across Social Networks , 2017, SoutheastCon 2017.

[13]  Virgílio A. F. Almeida,et al.  Studying User Footprints in Different Online Social Networks , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[14]  William E. Winkler,et al.  String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage. , 1990 .

[15]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[16]  R. Real,et al.  The Probabilistic Basis of Jaccard's Index of Similarity , 1996 .

[17]  Anazida Zainal,et al.  Fraud detection system: A survey , 2016, J. Netw. Comput. Appl..

[18]  Timo Sztyler,et al.  An Approach for Incremental Entity Resolution at the Example of Social Media Data , 2014, AIMashup@ESWC.

[19]  Lior Rokach,et al.  Matching entities across online social networks , 2014, Neurocomputing.

[20]  JooYoung Lee,et al.  Estimating the Degrees of Neighboring Nodes in Online Social Networks , 2014, PRIMA.

[21]  Lior Rokach,et al.  Entity Matching in Online Social Networks , 2013, 2013 International Conference on Social Computing.

[22]  Ashwin Machanavajjhala,et al.  Entity Resolution: Theory, Practice & Open Challenges , 2012, Proc. VLDB Endow..