A Solution to Tweet-Based User Identification Across Online Social Networks

User identification can help us build better users’ profiles and benefit many applications. It has attracted many scholars’ attention. The existing works with good performance are mainly based on the rich online data. However, due to the privacy settings, it is costless or even difficult to obtain the rich data. Besides some profile attributes do not require exclusivity and are easily faked by users for different purposes. This makes the existing schemes are quite fragile. Users often publicly publish their activities on different social networks. This provides a way to overcome the above problem. We aim to address the user identification only based on users’ tweets. We first formulate the user identification based on tweets and propose a tweet-based user identification model. Then a supervised machine learning based solution is presented. It consists of three key steps: first, we propose several algorithms to measure the spatial similarity, temporal similarity and content similarity of two tweets; second, we extract the spatial, temporal and content features to exploit information redundancies; Afterwards, we employ the machine learning method for user identification. The experiment shows that the proposed solution can provide excellent performance with F1 values reaching 89.79%, 86.78% and 86.24% on three ground truth datasets, respectively. This work shows the possibility of user identification with easily accessible and not easily impersonated online data.

[1]  Virgílio A. F. Almeida,et al.  Studying User Footprints in Different Online Social Networks , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[2]  Vincent Yun Shen,et al.  User Identification across Social Networks using the Web Profile and Friend Network , 2010, Int. J. Web Appl..

[3]  Richard Chbeir,et al.  User Profile Matching in Social Networks , 2010, 2010 13th International Conference on Network-Based Information Systems.

[4]  Seung-won Hwang,et al.  SocialSearch: enhancing entity search with social network matching , 2011, EDBT/ICDT '11.

[5]  Xiaoping Zhou,et al.  Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks , 2016, IEEE Transactions on Knowledge and Data Engineering.

[6]  Francesco Buccafurri,et al.  Discovering Links among Social Networks , 2012, ECML/PKDD.

[7]  Silvio Lattanzi,et al.  An efficient reconciliation algorithm for social networks , 2013, Proc. VLDB Endow..

[8]  Philip S. Yu,et al.  Predicting Social Links for New Users across Aligned Heterogeneous Social Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.

[9]  Vitaly Shmatikov,et al.  De-anonymizing Social Networks , 2009, 2009 30th IEEE Symposium on Security and Privacy.

[10]  Vincent Y. Shen,et al.  User identification across multiple social networks , 2009, 2009 First International Conference on Networked Digital Technologies.

[11]  Shiyang Lu,et al.  Social Friend Recommendation Based on Network Correlation and Feature Co-Clustering , 2015, ICMR.

[12]  Reza Zafarani,et al.  Connecting users across social media sites: a behavioral-modeling approach , 2013, KDD.

[13]  George Varghese,et al.  I seek you: searching and matching individuals in social networks , 2009, WIDM.

[14]  Sree Hari Krishnan Parthasarathi,et al.  Exploiting innocuous activity for correlating users across sites , 2013, WWW.

[15]  Chun Chen,et al.  Mapping Users across Networks by Manifold Alignment on Hypergraph , 2014, AAAI.

[16]  Hector Garcia-Molina,et al.  Identifying users in social networks with limited information , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[17]  Gene Tsudik,et al.  Exploring Linkability of User Reviews , 2012, ESORICS.

[18]  Nacéra Bennacer,et al.  Matching User Profiles Across Social Networks , 2014, CAiSE.

[19]  Peter Fankhauser,et al.  Identifying Users Across Social Tagging Systems , 2011, ICWSM.

[20]  Anupam Joshi,et al.  @i seek 'fb.me': identifying users across multiple online social networks , 2013, WWW.

[21]  Oana Goga,et al.  Matching user accounts across online social networks : methods and applications. (Corrélation des profils d'utilisateurs dans les réseaux sociaux : méthodes et applications) , 2014 .

[22]  Fan Zhang,et al.  What's in a name?: an unsupervised approach to link users across communities , 2013, WSDM.

[23]  Bartunov Sergey,et al.  Joint Link-Attribute User Identity Resolution in Online Social Networks , 2012 .

[24]  Hamid R. Rabiee,et al.  Predicting anchor links between heterogeneous social networks , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[25]  Ponnurangam Kumaraguru,et al.  Finding Nemo: Searching and Resolving Identities of Users Across Online Social Networks , 2012, ArXiv.

[26]  Geert-Jan Houben,et al.  Cross-system user modeling and personalization on the Social Web , 2013, User Modeling and User-Adapted Interaction.

[27]  Reza Zafarani,et al.  User Identification Across Social Media , 2015, ACM Trans. Knowl. Discov. Data.

[28]  Philip S. Yu,et al.  Inferring anchor links across multiple heterogeneous social networks , 2013, CIKM.

[29]  Claude Castelluccia,et al.  How Unique and Traceable Are Usernames? , 2011, PETS.