Precursors and Laggards: An Analysis of Semantic Temporal Relationships on a Blog Network

Most current methods of quantifying the contribution of nodes in blog networks do not account for temporal relationships. We provide a method for measuring how early or late bloggers typically are, in the topic flow of a network of related blogs. Furthermore, we show that this type of analysis adds to the knowledge that can be extracted by studying the network only at the structural level of URL links. We present an algorithm to automatically detect fine-grained discussion topics, characterized by n-grams and time intervals. We then propose a probabilistic model to estimate the temporal relationships that blogs have with one another. We define the precursor score of blog A in relation to blog B as the probability that A enters a new topic before B, discounting the effect created by asymmetric posting rates. Network-level metrics of precursor and laggard behavior are derived from these dyadic precursor score estimations. This model is used to analyze a network of French political blogs. The scores are compared to traditional link degree metrics. We obtain insights into the dynamics of topic participation on this network, as well as the relationship between precursor/laggard and linking behaviors. We validate and analyze results with the help of an expert on the French blogosphere. Finally, we propose possible applications to the improvement of search engine ranking algorithms.

[1]  Gérard Lenclud,et al.  La culture s'attrape-t-elle ? , 1998 .

[2]  Xiang Ji,et al.  Topic evolution and social interactions: how authors effect research , 2006, CIKM '06.

[3]  D. Sperber,et al.  Explaining Culture: A Naturalistic Approach , 1998 .

[4]  Eytan Adar,et al.  Implicit Structure and the Dynamics of Blogspace , 2004 .

[5]  KleinbergJon Bursty and Hierarchical Structure in Streams , 2003 .

[6]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[7]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[8]  Akshay Java,et al.  Tracking Influence and Opinions in Social Media , 2006 .

[9]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[10]  Ravi Kumar,et al.  On the Bursty Evolution of Blogspace , 2003, WWW '03.

[11]  T. Valente Social network thresholds in the diffusion of innovations , 1996 .

[12]  Jon M. Kleinberg,et al.  The structure of information pathways in a social communication network , 2008, KDD.

[13]  Steven Skiena,et al.  Newspapers vs. Blogs: Who Gets the Scoop? , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[14]  Christos Faloutsos,et al.  Cascading Behavior in Large Blog Graphs , 2007 .

[15]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[16]  Gilad Mishne,et al.  Why Are They Excited? Identifying and Explaining Spikes in Blog Mood Levels , 2006, EACL.

[17]  Dan Sperber,et al.  Why Modeling Cultural Evolution Is Still Such a Challenge , 2006 .

[18]  Gilad Mishne,et al.  Capturing Global Mood Levels using Blog Posts , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[19]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[20]  Camille Roth,et al.  Socio-semantic Dynamics in a Blog Network , 2009, 2009 International Conference on Computational Science and Engineering.

[21]  D. Watts,et al.  Influentials, Networks, and Public Opinion Formation , 2007 .

[22]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[23]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[24]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[25]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[26]  Tim Oates,et al.  Modeling the Spread of Influence on the Blogosphere , 2006 .