Applying and Comparing Hidden Markov Model and Fuzzy Clustering Algorithms to Web Usage Data for Recommender Systems

In this study, we apply and compare some of the methods of usage pattern discovery, like simple k-means clustering algorithm, fuzzy relational subtractive clustering algorithm, fuzzy mean field annealing (MFA) clustering and Hidden Markov Model (HMM), for recommender systems. We use metrics like prediction strength, hit ratio, precision, prediction ability and F-Score to compare the applied methods on the Web usage data. Fuzzy MFA and HMM acted better than other methods due to fuzzy nation of human behavior in navigation and extra information utilized in sequence analysis.

[1]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[2]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[3]  Won Don Lee,et al.  A Mean Field Annealing Algorithm for Fuzzy Clustering , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[4]  Anupam,et al.  Mining Web Access Logs Using Relational Competitive Fuzzy Clustering , 1999 .

[5]  Padhraic Smyth,et al.  Model-Based Clustering and Visualization of Navigation Patterns on a Web Site , 2003, Data Mining and Knowledge Discovery.

[6]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[7]  Cyrus Shahabi,et al.  Knowledge discovery from users Web-page navigation , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[8]  Anupam Joshi,et al.  On Mining Web Access Logs , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[9]  Nematollaah Shiri,et al.  An Efficient Technique for Mining Usage Profiles Using Relational Fuzzy Subtractive Clustering , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.

[10]  Oren Etzioni,et al.  The World-Wide Web: quagmire or gold mine? , 1996, CACM.

[11]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[12]  Ajith Abraham,et al.  Business Intelligence from Web Usage Mining , 2003, J. Inf. Knowl. Manag..

[13]  Tao Luo,et al.  Using sequential and non-sequential patterns in predictive Web usage mining tasks , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[14]  Jaideep Srivastava,et al.  Incorporating Concept Hierarchies into Usage Mining Based Recommendations , 2006, WEBKDD.

[15]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[16]  M. S. Ryan,et al.  The Viterbi Algorithm 1 1 The Viterbi Algorithm . , 2009 .