Identifying the Large-Scale Structure of the Blogosphere

We analyze a topological structure of networks formed according to the entries and trackbacks in the blogosphere, which is a collection of weblog articles. The analysis is performed based on community extraction, network visualization and keyword analysis. It is shown that the large-scale structure of the blogosphere has a globally sparse, but locally dense structure. The entries in a community yield a dense structure while the trackbacks that interconnect communities are sparse. The visualized results show sparkling-firework-like patterns. We then attempt to characterize the communities using a tf-idf technique. It is found that specific topics are discussed in each community. These results will help us to identify the communities in which certain specific topics discussed and to detect trends in the blogosphere.

[1]  Ravi Kumar,et al.  On the Bursty Evolution of Blogspace , 2003, WWW '03.

[2]  Petter Holme,et al.  Structure and time evolution of an Internet dating community , 2002, Soc. Networks.

[3]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[5]  P Cohen,et al.  Joint efforts. , 1994, Nursing times.

[6]  Edward M Marcotte,et al.  LGL: creating a map of protein function with an algorithm for visualizing very large biological networks. , 2004, Journal of molecular biology.

[7]  Lada A. Adamic,et al.  Looking at the Blogosphere Topology through Different Lenses , 2007, ICWSM.

[8]  Naoki Shibata,et al.  Identification and Visualization of Emerging Trends from Blogosphere , 2007, ICWSM.

[9]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Edith Cohen,et al.  A short walk in the Blogistan , 2006, Comput. Networks.

[11]  Long Wang,et al.  The structure of self-organized blogosphere , 2006 .

[12]  Christian Wagner,et al.  Weblogging: A study of social computing and its impact on organizations , 2008, Decis. Support Syst..

[13]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[14]  Alvin Chin,et al.  FINDING EVIDENCE OF COMMUNITY FROM BLOGGING CO-CITATIONS : A SOCIAL NETWORK ANALYTIC APPROACH , 2005 .

[15]  D. Butler Science in the web age: Joint efforts , 2005, Nature.

[16]  Yun Chi,et al.  Discovery of Blog Communities based on Mutual Awareness , 2006 .

[17]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.