A Concentric-Circle Model for Community Mining in Graph Structures

Discovering communities from a graph structure such as the Web has become an interesting research problem recently. In this paper, comparing with the state-of-the-art authority detecting and graph partitioning methods, we propose a concentric-circle model to more accurately define communities. With this model, a community could be described as a set of concentric-circles. The most important objects representing the concept of a whole community lie in the center and are called core objects. Affiliated objects, which are related to the core objects, surround the core with different ranks. Base on the concentric-circle model, a novel algorithm is developed to discover communities conforming to this model. We also conducted a case study to automatically discover research interest groups in the computer science domain from the Web. Experiments show that our method is very effective to generate high-quality communities with more clear structure and more tunable granularity.

[1]  C. Lee Giles,et al.  Clustering and identifying temporal trends in document databases , 2000, Proceedings IEEE Advances in Digital Libraries 2000.

[2]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[3]  Jon M. Kleinberg,et al.  Inferring Web communities from link topology , 1998, HYPERTEXT '98.

[4]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[5]  AgrawalRakesh,et al.  Mining association rules between sets of items in large databases , 1993 .

[6]  David Harlan Wood,et al.  Discovering Shared Interests Among People Using Graph Analysis , 1993 .

[7]  Jon M Kleinberg,et al.  Hubs, authorities, and communities , 1999, CSUR.

[8]  Bart Selman,et al.  Referral Web: combining social networks and collaborative filtering , 1997, CACM.

[9]  Ravi Kumar,et al.  Trawling the Web for Emerging Cyber-Communities , 1999, Comput. Networks.

[10]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[11]  David Cohn,et al.  Learning to Probabilistically Identify Authoritative Documents , 2000, ICML.

[12]  Bing Liu,et al.  Searching People on the Web According to Their Interests , 2002 .

[13]  Les Carr,et al.  Trailblazing the literature of hypertext: author co-citation analysis (1989–1998) , 1999, HYPERTEXT '99.

[14]  Jon M. Kleinberg,et al.  Mining the Web's Link Structure , 1999, Computer.

[15]  C. Lee Giles,et al.  Efficient identification of Web communities , 2000, KDD '00.

[16]  Bart Selman,et al.  The Hidden Web , 1997, AI Mag..

[17]  Martin van den Berg,et al.  Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.

[18]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[19]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .