Web user clustering analysis based on KMeans algorithm

As one of the most important tasks of Web Usage Mining (WUM), web user clustering, which establishes groups of users exhibiting similar browsing patterns, provides useful knowledge to personalized web services. In this paper, we cluster web users with KMeans algorithm based on web user log data. Given a set of web users and their associated historical web usage data, we study their behavior characteristic and cluster them. Experiment results show the feasibility and efficiency of such algorithm application. Web user clusters generated in this way can provide novel and useful knowledge for various personalized web applications.

[1]  Tao Luo,et al.  Effective personalization based on association rule discovery from web usage data , 2001, WIDM '01.

[2]  Dhananjay S. Phatak,et al.  Clustering for personalized mobile Web usage , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[3]  Qiang Yang,et al.  Mining web logs for prediction models in WWW caching and prefetching , 2001, KDD '01.

[4]  Jaideep Srivastava,et al.  Web usage mining: discovery and application of interesting patterns from web data , 2000 .

[5]  LalmasMounia Dempster-Shafer's theory of evidence applied to structured documents , 1997 .

[6]  Mounia Lalmas,et al.  Dempster-Shafer's theory of evidence applied to structured documents: modelling uncertainty , 1997, SIGIR '97.

[7]  Oren Etzioni,et al.  Adaptive Web Sites: Automatically Synthesizing Web Pages , 1998, AAAI/IAAI.

[8]  Jaideep Srivastava,et al.  Creating adaptive Web sites through usage-based clustering of URLs , 1999, Proceedings 1999 Workshop on Knowledge and Data Engineering Exchange (KDEX'99) (Cat. No.PR00453).

[9]  Olfa Nasraoui,et al.  A New Evolutionary Approach to Web Usage and Context Sensitive Associations Mining , 2002 .

[10]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[11]  Ke Wang,et al.  Classification Pruning for Web-request Prediction , 2001, WWW Posters.

[12]  Vir V. Phoha,et al.  Web user clustering from access log using belief function , 2001, K-CAP '01.