Extracting Latent Weblog Communities-A Partitioning Algorithm for Bipartite Graphs -

Abstract: I propose the concept of a latent weblog community (LBC), as a means to promote the autonomous organization of knowledge on the Internet. Such communities can be illustrated in terms of bipartite graphs based on weblog update information, and they can effectively function to create meeting spaces for bloggers who write about similar or closely related topics but do not know each other. To extract these communities from blogspace, I developed a partitioning algorithm known as the Weakest Pair (WP) algorithm, which separates the weakest pairs of bloggers and webpages, respectively, using co-citation information. As a result of numerical evaluation, the WP algorithm is more effective than the Shortest Path Betweenness (SPB) algorithm in terms of information loss and completeness of bipartite graphs. I will provide three examples of LBC extracted using the WP algorithm and report its secondary effects, i.e. personae detection, the detection of a set of weblogs owned by a single blogger.

[1]  Krishna Bharat,et al.  Improved algorithms for topic distillation in a hyperlinked environment , 1998, SIGIR '98.

[2]  Jon M. Kleinberg,et al.  Inferring Web communities from link topology , 1998, HYPERTEXT '98.

[3]  M. KleinbergJon Authoritative sources in a hyperlinked environment , 1999 .

[4]  D. Libby RDF Site Summary (RSS) 0.9 official DTD , 1999 .

[5]  Ravi Kumar,et al.  Trawling the Web for Emerging Cyber-Communities , 1999, Comput. Networks.

[6]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.