Social Stream Data: Formalism, Properties and Queries

A social stream, which refers to the data stream that records a series of social stream entities and the dynamic relations between entities, and each entity created by one producer. It is not only can used to model user generate content in online social network services, but also a multitude of systems in which records are combined by graph and stream data. Thus, the research efforts in the area about social stream is one of the hot spots recently. Although the term of “social stream” have appeared frequently, we note there are rarely formal definitions and lacks a unified view on the data. In this paper, we formally define the social stream data model trying to explain the graph stream generating mechanism from the perspective of producers. Then several properties describing social stream data are introduced. Furthermore, we summarize a set of basic operators that are essential to analytic queries based on social stream data, describe their semantics in detail. A classification scheme based on query time window is provided and difficulties lies behind each type are discussed. Finally, three real life datasets are used for the experiment of calculating properties to reveal differences between different datasets and analyze how they may exacerbate hardness of queries.

[1]  Afonso Ferreira,et al.  Building a reference combinatorial model for MANETs , 2004, IEEE Network.

[2]  Jari Saramäki,et al.  Temporal Networks , 2011, Encyclopedia of Social Network Analysis and Mining.

[3]  Aoying Zhou,et al.  Workload-Aware Cache for Social Media Data , 2013, APWeb.

[4]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[5]  Amol Deshpande,et al.  Managing large dynamic graphs efficiently , 2012, SIGMOD Conference.

[6]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[7]  S. Bornholdt,et al.  Scale-free topology of e-mail networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Aristides Gionis,et al.  Social piggybacking: leveraging common friends to generate event streams , 2012, SNS '12.

[9]  Kazuyuki Aihara,et al.  Quantifying Collective Attention from Tweet Stream , 2013, PloS one.

[10]  Nicola Santoro,et al.  Time-Varying Graphs and Social Network Analysis: Temporal Indicators and Metrics , 2011, ArXiv.

[11]  Michalis Faloutsos,et al.  A simple conceptual model for the Internet topology , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[12]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[13]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[14]  Aoying Zhou,et al.  Towards modeling popularity of microblogs , 2013, Frontiers of Computer Science.

[15]  S. Redner How popular is your paper? An empirical study of the citation distribution , 1998, cond-mat/9804163.

[16]  Divesh Srivastava,et al.  Dense subgraph maintenance under streaming edge weight updates for real-time story identification , 2012, The VLDB Journal.

[17]  Ko Fujimura,et al.  Improving tweet stream classification by detecting changes in word probability , 2012, SIGIR '12.

[18]  Pablo Rodriguez,et al.  The little engine(s) that could: scaling online social networks , 2010, SIGCOMM '10.

[19]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[20]  Andrey E. Miroshnichenko,et al.  Extended SSH Model: Non-Local Couplings and Non-Monotonous Edge States , 2018, Physics.

[21]  Jianjun Xie,et al.  Modeling microblogging communication based on human dynamics , 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[22]  Alan Bensky,et al.  Technologies and applications , 2019, Short-range Wireless Communication.

[23]  Junghoo Cho,et al.  Topical semantics of twitter links , 2011, WSDM '11.

[24]  Walter Quattrociocchi,et al.  Selection in scientific networks , 2010, Social Network Analysis and Mining.

[25]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, SKDD.

[26]  S. Redner,et al.  Connectivity of growing random networks. , 2000, Physical review letters.

[27]  Hai Jin,et al.  Minimizing inter-server communications by exploiting self-similarity in online social networks , 2012, ICNP.

[28]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[29]  Joaquín Salvachúa,et al.  Social Stream, a social network framework , 2012, The First International Conference on Future Generation Communication Technologies.

[30]  M. Newman,et al.  Coauthorship and citation patterns in the Physical Review. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.