InfoFlow: Mining Information Flow Based on User Community in Social Networking Services

Online social networking services (SNSs) have emerged rapidly and have become huge data sources for social network analysis. The spread of the content generated by users is crucial in SNS, but there is only a handful of research works on information diffusion and, more precisely, information diffusion flow. In this paper, we propose a novel method to discover information diffusion processes from SNS data. The method starts preprocessing the SNS data using a user-centric algorithm of community detection based on modularity maximization with the purpose of reducing the complexity of the noisy data. After that, the InfoFlow miner generates information diffusion flow models among the user communities discovered from the data. The algorithm is an extension of a traditional process discovery technique called the Flexible Heuristics miner, but the visualization ability of the generated process model is improved with a new measure called response weight, which effectively captures and represents the interactions among communities. An experiment with Facebook data was conducted, and information flow among user communities was visualized. Additionally, a quality assessment of the models was carried out to demonstrate the effectiveness of the method. The final constructed models allowed us to identify useful information such as how the information flows between communities and information disseminators and receptors within communities.

[1]  Jonghun Park,et al.  Discovery of Information Diffusion Process in Social Networks , 2012, IEICE Trans. Inf. Syst..

[2]  Wil M. P. van der Aalst,et al.  Process Mining , 2016, Springer Berlin Heidelberg.

[3]  Cw Christian Günther,et al.  Improving Process Mining with Trace Clustering , 2008 .

[4]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[5]  Jae-Yoon Jung PROCL:A Process Log Clustering System , 2008 .

[6]  Cécile Favre,et al.  Information diffusion in online social networks: a survey , 2013, SGMD.

[7]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[8]  Munmun De Choudhury Discovery of information disseminators and receptors on online social media , 2010, HT '10.

[9]  M. Castellanos,et al.  Conformance testing : measuring the fit and appropriateness of event logs and process models , 2013 .

[10]  Hong Chen,et al.  Modeling Information Diffusion over Social Networks for Temporal Dynamic Prediction , 2013, IEEE Transactions on Knowledge and Data Engineering.

[11]  Xinbing Wang,et al.  The Value Strength Aided Information Diffusion in Socially-Aware Mobile Networks , 2016, IEEE Access.

[12]  Min Chen,et al.  Modeling for Information Diffusion in Online Social Networks via Hydrodynamics , 2017, IEEE Access.

[13]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[14]  Kwanho Kim,et al.  Analyzing Information Flow and Context for Facebook Fan Pages , 2014, IEICE Trans. Inf. Syst..

[15]  Cristóbal Romero,et al.  Clustering for improving educational process mining , 2014, LAK.

[16]  J. Moreno Who Shall Survive: A New Approach to the Problem of Human Interrelations , 2017 .

[17]  Kai Zhang,et al.  Big data driven information diffusion analysis and control in online social networks , 2017, 2017 IEEE International Conference on Communications (ICC).

[18]  Marielba Zacarias,et al.  Approaching Process Mining with Sequence Clustering: Experiments and Findings , 2007, BPM.

[19]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[20]  Jae-Yoon Jung,et al.  Discovering Information Diffusion Processes Based on Hidden Markov Models for Social Network Services , 2015, AP-BPM.

[21]  Bart Baesens,et al.  A multi-dimensional quality assessment of state-of-the-art process discovery algorithms using real-life event logs , 2012, Inf. Syst..

[22]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[23]  Wil M. P. van der Aalst,et al.  Mining Social Networks: Uncovering Interaction Patterns in Business Processes , 2004, Business Process Management.

[24]  Bart Baesens,et al.  Active Trace Clustering for Improved Process Discovery , 2013, IEEE Transactions on Knowledge and Data Engineering.

[25]  Paolo Tonella,et al.  Cluster‐based modularization of processes recovered from web applications , 2013, J. Softw. Evol. Process..

[26]  Boudewijn F. van Dongen,et al.  Process Mining Based on Clustering: A Quest for Precision , 2007, Business Process Management Workshops.

[27]  Wil M. P. van der Aalst,et al.  Context Aware Trace Clustering: Towards Improving Process Mining Results , 2009, SDM.

[28]  A. J. M. M. Weijters,et al.  Flexible Heuristics Miner (FHM) , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[29]  Wil M. P. van der Aalst,et al.  Discovering Hierarchical Process Models Using ProM , 2011, CAiSE Forum.