Fuzzy Category and Fuzzy Interest for Web User Understanding

Web usage mining is a research field for searching potentially useful and valuable information from web log file. Web log file is a simple list of pages that users refer. Therefore, it is not easy to analyze user's current interest field from web log file. This paper presents web usage mining method for finding users' current interest based on Fuzzy category. We consider not only how many times a user visits pages but also when he visits. We describe a user's current interest with a fuzzy interest degree to categories. Based on fuzzy categories and fuzzy interest degrees, we also propose a method for understanding web users. For this, we define the category vector space. We also present experiment results which shows how our method helps to understand web users.

[1]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[2]  Myra Spiliopoulou,et al.  Web usage mining for Web site evaluation , 2000, CACM.

[3]  Attila Gyenesei,et al.  A Fuzzy Approach for Mining Quantitative Association Rules , 2000, Acta Cybern..

[4]  Arbee L. P. Chen,et al.  Enabling personalized recommendation on the Web based on user interests and behaviors , 2001, Proceedings Eleventh International Workshop on Research Issues in Data Engineering. Document Management for Data Intensive Business and Scientific Applications. RIDE 2001.

[5]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[6]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[7]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[8]  Sung-Hae Jun,et al.  Fuzzy Web Usage Mining for User Modeling , 2002, Int. J. Fuzzy Log. Intell. Syst..

[9]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.