Organization of Information for the Web Using Hierarchical Fuzzy Clustering Algorithm Based on Co-occurrence Networks

In this paper, we present a Hierarchical Fuzzy Clustering algorithm which uses domain knowledge to automatically determine the number of clusters and their initial values. The algorithm is applied on a collection of web pages and the results are compared with existing algorithms in the literature.

[1]  Weina Wang,et al.  On fuzzy cluster validity indices , 2007, Fuzzy Sets Syst..

[2]  Shyi-Ming Chen,et al.  A new method for fuzzy information retrieval based on fuzzy hierarchical clustering and fuzzy inference techniques , 2005, IEEE Transactions on Fuzzy Systems.

[3]  Inderjit S. Dhillon,et al.  Enhanced word clustering for hierarchical text classification , 2002, KDD.

[4]  Guy Melançon,et al.  Revealing Hidden Community Structures and Identifying Bridges in Complex Networks: An Application to Analyzing Contents of Web Pages for Browsing , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[5]  Jeffrey Zeldman,et al.  Taking Your Talent to the Web: A Guide for the Transitioning Designer , 2001 .

[6]  Sushmita Mitra,et al.  Web mining: a survey in the fuzzy framework , 2004, Fuzzy Sets Syst..

[7]  William-Chandra Tjhi,et al.  Possibilistic fuzzy co-clustering of large document collections , 2007, Pattern Recognit..

[8]  Guy Melançon,et al.  Identifying the presence of communities in complex networks through topological decomposition and component densities , 2010, EGC.

[9]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[10]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[11]  E. Trauwaert On the meaning of Dunn's partition coefficient for fuzzy clusters , 1988 .

[12]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[13]  Gloria Bordogna,et al.  Hierarchical-Hyperspherical Divisive Fuzzy C-Means (H2D-FCM) Clustering for Information Retrieval , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.