Identification and characterisation of Facebook user profiles considering interaction aspects

ABSTRACT The great number of social network users and the expansion of this kind of tool in the last years demand the storage of a great volume of information regarding user behaviour. In this article, we utilise interaction records from Facebook users and metrics from complex networks study, to identify different user behaviours using clustering techniques. We found three different user profiles regarding interactions performed in the social network: viewer, participant and content producer. Moreover, the groups we found were characterised by the C4.5 decision-tree algorithm. The 'viewer' mainly observes what happens in the network. The ‘participant’ interacts more often with the content, getting a higher value of closeness centrality. Therefore, users with a participant profile are responsible, for example, for the faster transmission of information in the virtual environment, a crucial function for the Facebook social network. We noted too that ‘content producer’ users had a greater quantity of publications in their pages, leading to a superior degree of input interactions than the other two profiles. Finally, we also verify that the profiles are not mutually exclusive, that is, the user of a profile can at determined moment perform the behaviour of another profile.

[1]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[2]  Nadeem Akhtar,et al.  Social Network Analysis Tools , 2014, 2014 Fourth International Conference on Communication Systems and Network Technologies.

[3]  Mohammed J. Zaki Data Mining and Analysis: Fundamental Concepts and Algorithms , 2014 .

[4]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[5]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[6]  Virgílio A. F. Almeida,et al.  Characterization and Analysis of User Profiles in Online Video Sharing Systems , 2010, J. Inf. Data Manag..

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[9]  Marcelo Maia,et al.  Identifying user behavior in online social networks , 2008, SocialNets '08.

[10]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[11]  Chi-Cheng Wu,et al.  What makes users share content on facebook? Compatibility among psychological incentive, social capital focus, and content type , 2017, Comput. Hum. Behav..

[12]  D. Edwards Data Mining: Concepts, Models, Methods, and Algorithms , 2003 .

[13]  M. S. Sishodia,et al.  Societal Networks: The Networks of Dynamics of Interpersonal Associations , 2014 .

[14]  Hawoong Jeong,et al.  Statistical properties of sampled networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Arthur Zimek,et al.  Density-Based Clustering Validation , 2014, SDM.

[16]  Juan Liu,et al.  Characterizing user behavior and information propagation on a social multimedia network , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[17]  Chris Chao Su,et al.  Predicting social capital on Facebook: The implications of use intensity, perceived content desirability, and Facebook-enabled communication practices , 2017, Comput. Hum. Behav..

[18]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[19]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[20]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[21]  Thomas W. Valente,et al.  The stability of centrality measures when networks are sampled , 2003, Soc. Networks.

[22]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[23]  Arthur Zimek,et al.  A Framework for Clustering Uncertain Data , 2015, Proc. VLDB Endow..

[24]  Yi-Fen Chen,et al.  See you on Facebook: exploring influences on Facebook continuous usage , 2014, Behav. Inf. Technol..

[25]  Cristiane Neri Nobre,et al.  A influência das estratégias persuasivas no comportamento dos usuários no Facebook , 2014, IHC.

[26]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[27]  Virgílio A. F. Almeida,et al.  Tips, dones and todos: uncovering user profiles in foursquare , 2012, WSDM '12.

[28]  Ben Shneiderman,et al.  Analyzing Social Media Networks with NodeXL: Insights from a Connected World , 2010 .

[29]  Weijia Jia,et al.  Influence analysis in social networks: A survey , 2018, J. Netw. Comput. Appl..

[30]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.