An Effective Web User Analysis and Clustering using FPCM Algorithm

Internet is one of the users friendly to the world and also gives lot of information for every activity. Based on this there is huge amount of development in World Wide Web. There are varieties of issues connected with the existing web usage mining approaches. Applying data mining techniques to the discovery of usage patterns from Web data and various applications is the process for web usage mining. Practical applicability is one of the problems in web usage mining. The essence of accurate fast time routine of web user information systems to the needs of their user. Preprocessing and clustering of web user are important evolve in this paper. Preprocessing is one of the key processes to remove noisy log data and to produce a clear data as an effective mining process. Examine a novel approach to removing local and global noise and web robots and by preprocessing, clustering Web site users into different groups and generating common user profiles. This type of profile is useful for the user to develop their own personalize website and for various applications. FPCM (Fuzzy Possibilistic C Means) is one of the algorithms to use easily. FPCM gives comparable results with FCM (Fuzzy C Means) reported in the literature of web mining. Anonymous Microsoft Web Dataset is used for evaluating the proposed preprocessing technique and clustering process.

[1]  Kyuseok Shim,et al.  Data mining and the Web: past, present and future , 1999, WIDM '99.

[2]  F. Moura-Pires,et al.  A fuzzy clustering model of data and fuzzy c-means , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[3]  Oren Etzioni,et al.  Adaptive Web Sites: an AI Challenge , 1997, IJCAI.

[4]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[5]  Umeshwar Dayal,et al.  From User Access Patterns to Dynamic Hypertext Linking , 1996, Comput. Networks.

[6]  Philip S. Yu Data mining and personalization technologies , 1999, Proceedings. 6th International Conference on Advanced Systems for Advanced Applications.

[7]  Vir V. Phoha,et al.  Web user clustering from access log using belief function , 2001, K-CAP '01.

[8]  Jian Xiao,et al.  Robust Interval Type-2 Possibilistic C-means Clustering and its Application for Fuzzy Modeling , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[9]  William-Chandra Tjhi,et al.  Fuzzy co-clustering of Web documents , 2005, 2005 International Conference on Cyberworlds (CW'05).

[10]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[11]  Rong Jin,et al.  Speedup of fuzzy and possibilistic kernel c-means for large-scale clustering , 2011, 2011 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2011).

[12]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[13]  Peter Brusilovsky,et al.  Open user profiles for adaptive news systems: help or harm? , 2007, WWW '07.

[14]  Jaideep Srivastava,et al.  Creating adaptive Web sites through usage-based clustering of URLs , 1999, Proceedings 1999 Workshop on Knowledge and Data Engineering Exchange (KDEX'99) (Cat. No.PR00453).

[15]  R. Selmic,et al.  Web User Clustering and Its Application to Prefetching Using ART Neural Networks , 2003 .

[16]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[17]  Fernando Moura-Pires,et al.  Multiple Prototype Model for Fuzzy Clustering , 1999, IDA.

[18]  Tadeusz Morzy,et al.  Web Users Clustering , 2002 .

[19]  A. Vaishnavi,et al.  Effective Web personalization system using Modified Fuzzy Possibilistic C Means , 2011 .