How to Derive Fuzzy User Categories for Web Personalization

Today, Web personalization offers valid tools for the development of applications that have the attractive property to meet in a more effective manner the needs of their users. To do this, Web developers have to address an important challenge concerning the discovery of knowledge about interests that users exhibit during their interactions with Web sites. Web Usage Mining (WUM) is an active research area aimed at the discovery of useful patterns of typical user behaviors by exploiting usage data. Among the different proposed techniques for WUM, clustering has been widely employed in order to categorize users by grouping together users sharing similar interests. In particular, fuzzy clustering reveals to be an approach especially suitable to derive user categories from Web usage data available in log files. Usually, fuzzy clustering is based on the use of distance-based metrics (such as the Euclidean measure) to evaluate similarity between user preferences. However, the use of such measures may lead to ineffective results by identifying user categories that do not capture the semantic information incorporated in the original Web usage data. In particular, in this chapter, we propose an approach based on a relational fuzzy clustering algorithm equipped with a fuzzy similarity measure to derive user categories. As an application example, we apply the proposed approach on usage data extracted from log files of a real Web site. A comparison with the results obtained using the cosine measure is shown to demonstrate the effectiveness of the fuzzy similarity measure.

[1]  Anupam Joshi,et al.  On Mining Web Access Logs , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[2]  Giovanna Castellano,et al.  LODAP: a log data preprocessor for mining web browsing patterns , 2007 .

[3]  Beatrice Lazzerini,et al.  A Hierarchical Fuzzy Clustering-based System to Create User Profiles , 2007, Soft Comput..

[4]  Anupam Joshi,et al.  Low-complexity fuzzy relational clustering algorithms for Web mining , 2001, IEEE Trans. Fuzzy Syst..

[5]  Xiaozhe Wang,et al.  Intelligent web traffic mining and analysis , 2005, J. Netw. Comput. Appl..

[6]  Ejub Kajan Information Technology Encyclopedia and Acronyms , 2002, Springer Berlin Heidelberg.

[7]  Giovanna Castellano,et al.  Relational fuzzy approach for mining user profiles , 2007 .

[8]  Ajith Abraham i-Miner: a Web usage mining framework using hierarchical intelligent systems , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[9]  Constantin V. Negoita,et al.  On Fuzzy Systems , 1978 .

[10]  Thomas A. Runkler,et al.  Web mining with relational clustering , 2003, Int. J. Approx. Reason..

[11]  Nematollaah Shiri,et al.  An Efficient Technique for Mining Usage Profiles Using Relational Fuzzy Subtractive Clustering , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.

[12]  Anupam Joshi,et al.  Relational clustering based on a new robust estimator with application to Web mining , 1999, 18th International Conference of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.99TH8397).

[13]  Sushmita Mitra,et al.  Web mining: a survey in the fuzzy framework , 2004, Fuzzy Sets Syst..

[14]  M. Amparo Vila,et al.  Obtaining User Profiles Via Web Usage Mining , 2008, IADIS European Conf. Data Mining.

[15]  Michalis Vazirgiannis,et al.  Cluster validity methods: part I , 2002, SGMD.

[16]  Manfred Broy Software Engineering From Auxiliary to Key Technology , 2002, Software Pioneers.

[17]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[18]  Bamshad Mobasher,et al.  Web Usage Mining and Personalization , 2004, The Practical Handbook of Internet Computing.

[19]  Wolfgang Grellmann,et al.  Crack resistance behavior of polyvinylchloride , 1997 .

[20]  Anupam Joshi,et al.  Automatic Web User Profiling and Personalization Using Robust Fuzzy Relational Clustering , 2002 .

[21]  Athena Vakali,et al.  An Overview of Web Data Clustering Practices , 2004, EDBT Workshops.

[22]  Anupam Joshi,et al.  Extracting Web User Profiles Using Relational Competitive Fuzzy Clustering , 2000, Int. J. Artif. Intell. Tools.

[23]  Pengfei Shi,et al.  Similarity measures on intuitionistic fuzzy sets , 2003, Pattern Recognit. Lett..

[24]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[25]  Pier Luca Lanzi,et al.  Mining interesting knowledge from weblogs: a survey , 2005, Data Knowl. Eng..

[26]  Yves Lechevallier,et al.  Dissimilarities for Web Usage Mining , 2006, Data Science and Classification.

[27]  H. Hers,et al.  Lysosomes and storage diseases , 1973 .