A heuristic approach to discovering user correlations from organized social stream data

Recently, with the widespread popularity of SNS (Social Network Service), such as Twitter, Facebook, people are increasingly accustomed to sharing feeling, experience and knowledge with each other on Internet. The high accessibility of these web sites has allowed the information to be spread across the social media more quickly and widely, which leads to more and more populations being engaged into this so-called social stream environment. All these make the organization of user relationships become increasingly important and necessary. In this study, we try to discover the potential and dynamical user correlations using those organized social streams in accordance with users’ current interests and needs, in order to assist the collaborative information seeking process. We develop a heuristic approach to build a Dynamically Socialized User Networking (DSUN) model, and define a set of measures (such as interest degree, and popularity degree) and concepts (such as complementary tie, weak tie, and strong tie), to discover and represent users’ current profiling and dynamical correlations. The corresponding algorithms are developed respectively. Finally, the architecture of the functional modules is presented, and the experiment results are demonstrated and discussed based on an application of the proposed model.

[1]  Dawid Weiss,et al.  A survey of Web clustering engines , 2009, CSUR.

[2]  Kirsten A. Johnson The effect of Twitter posts on students’ perceptions of instructor credibility , 2011 .

[3]  Jure Leskovec,et al.  Planetary-scale views on a large instant-messaging network , 2008, WWW.

[4]  Nargis Pervin,et al.  Fast, Scalable, and Context-Sensitive Detection of Trending Topics in Microblog Post Streams , 2013, TMIS.

[5]  Christopher C. Yang,et al.  Identifying implicit relationships between social media users to support social commerce , 2012, ICEC '12.

[6]  Xiaolong Wang,et al.  Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach , 2011, CIKM '11.

[7]  Desney S. Tan,et al.  CHI '11 Extended Abstracts on Human Factors in Computing Systems , 2008, CHI 2011.

[8]  Rossano Schifanella,et al.  Friendship prediction and homophily in social media , 2012, TWEB.

[9]  Sudipto Guha,et al.  Clustering Data Streams , 2000, FOCS.

[10]  Qun Jin,et al.  Dynamical User Networking and Profiling Based on Activity Streams for Enhanced Social Learning , 2011, ICWL.

[11]  Zhiwei Xu,et al.  Discovering and Browsing of Power Users by Social Relationship Analysis in Large-Scale Online Communities , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[12]  Hiroki Itoh,et al.  Challenges to Supporting Federated Assurance , 2009, Computer.

[13]  Jing Huang,et al.  Lightweight problem determination in DBMSs using data stream analysis techniques , 2010, CASCON.

[14]  Sattar Hashemi,et al.  Adapted One-versus-All Decision Trees for Data Stream Classification , 2009, IEEE Transactions on Knowledge and Data Engineering.

[15]  Timothy W. Bickmore,et al.  Empirical Validation of an Accommodation Theory-Based Model of User-Agent Relationship , 2012, IVA.

[16]  Ed H. Chi,et al.  Information Seeking Can Be Social , 2009, Computer.

[17]  Hong Chen,et al.  Generating associative ripples of relevant information from a variety of data streams by throwing a heuristic stone , 2011, ICUIMC '11.

[18]  Jennifer Neville,et al.  Modeling relationship strength in online social networks , 2010, WWW '10.

[19]  Wanda Pratt,et al.  Descriptive analysis of physical activity conversations on Twitter , 2011, CHI Extended Abstracts.

[20]  Reynol Junco,et al.  The effect of Twitter on college student engagement and grades , 2011, J. Comput. Assist. Learn..

[21]  Qun Jin,et al.  User Correlation Discovery and Dynamical Profiling Based on Social Streams , 2012, AMT.

[22]  Rui Li,et al.  Multiple Location Profiling for Users and Relationships from Social Network and Content , 2012, Proc. VLDB Endow..

[23]  Sudipto Guha,et al.  Clustering Data Streams: Theory and Practice , 2003, IEEE Trans. Knowl. Data Eng..

[24]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[25]  Carlos Ordonez,et al.  Clustering binary data streams with K-means , 2003, DMKD '03.

[26]  Francesco Bonchi,et al.  Cold start link prediction , 2010, KDD.

[27]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[28]  Matthew Andrews,et al.  Reconstruction and analysis of Twitter conversation graphs , 2012, HotSocial '12.

[29]  S. Soddu,et al.  The Loss of the p53 Activator HIPK2 Is Responsible for Galectin-3 Overexpression in Well Differentiated Thyroid Carcinomas , 2011, PloS one.

[30]  Hong Chen,et al.  A Framework of Organic Streams: Integrating Dynamically Diversified Contents into Ubiquitous Personal Study , 2010, 2010 7th International Conference on Ubiquitous Intelligence & Computing and 7th International Conference on Autonomic & Trusted Computing.

[31]  Dominik Benz,et al.  Visit me, click me, be my friend: an analysis of evidence networks of user relationships in BibSonomy , 2010, HT '10.

[32]  Bo Hu,et al.  Learning the Strength of the Factors Influencing User Behavior in Online Social Networks , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[33]  Yixin Chen,et al.  Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams , 2005, Distributed and Parallel Databases.

[34]  Chetan Gupta,et al.  CHAOS: A Data Stream Analysis Architecture for Enterprise Applications , 2009, 2009 IEEE Conference on Commerce and Enterprise Computing.

[35]  Di Jiang,et al.  Limosa: a system for geographic user interest analysis in Twitter , 2013, EDBT '13.

[36]  Shonali Krishnaswamy,et al.  Mining data streams: a review , 2005, SGMD.

[37]  Chris J. Musselle Rethinking Concepts of the Dendritic Cell Algorithm for Multiple Data Stream Analysis , 2012, ICARIS.

[38]  Ana Paiva,et al.  A Model for Social Regulation of User-Agent Relationships , 2012, IVA.

[39]  Bhavani M. Thuraisingham,et al.  Cloud-based malware detection for evolving data streams , 2011, ACM Trans. Manag. Inf. Syst..

[40]  Sean P. Goggins,et al.  Twitter zombie: architecture for capturing, socially transforming and analyzing the twittersphere , 2012, GROUP.

[41]  Yanggon Kim,et al.  Automated Twitter data collecting tool and case study with rule-based analysis , 2012, IIWAS '12.