Exploiting Domain Knowledge by Automated Taxonomy Generation in Recommender Systems

The effectiveness of incorporating domain knowledge into recommender systems to address their sparseness problem and improve their prediction accuracy has been discussed in many research works. However, this technique is usually restrained in practice because of its high computational expense. Although cluster analysis can alleviate the computational complexity of the recommendation procedure, it is not satisfactory in preserving pair-wise item similarities, which would severely impair the recommendation quality. In this paper, we propose an efficient approach based on the technique of Automated Taxonomy Generation to exploit relational domain knowledge in recommender systems so as to achieve high system scalability and prediction accuracy. Based on the domain knowledge, a hierarchical data model is synthesized in an offline phase to preserve the original pairwise item similarities. The model is then used by online recommender systems to facilitate the similarity calculation and keep their recommendation quality comparable to those systems by means of real-time exploiting domain knowledge. Experiments were conducted upon real datasets to evaluate our approach.

[1]  Alfred Kobsa,et al.  The Adaptive Web, Methods and Strategies of Web Personalization , 2007, The Adaptive Web.

[2]  Shui-Lung Chuang,et al.  Towards automatic generation of query taxonomy: a hierarchical query clustering approach , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[3]  Lars Schmidt-Thieme,et al.  Taxonomy-driven computation of product recommendations , 2004, CIKM '04.

[4]  George Karypis,et al.  Evaluation of hierarchical clustering algorithms for document datasets , 2002, CIKM '02.

[5]  W. Bruce Croft,et al.  Generating hierarchical summaries for web searches , 2003, SIGIR '03.

[6]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[7]  Stuart E. Middleton,et al.  Ontological user profiling in recommender systems , 2004, TOIS.

[8]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[9]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[10]  Pu-Jen Cheng,et al.  Auto-generation of topic hierarchies for web images from users' perspectives , 2003, CIKM '03.

[11]  Shourya Roy,et al.  A hierarchical monothetic document clustering algorithm for summarization and browsing search results , 2004, WWW '04.

[12]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[13]  B. Mobasher 3 Data Mining for Web Personalization , 2007 .

[14]  Mary Shapcott,et al.  Generating semantically enriched user profiles for Web personalization , 2007, TOIT.

[15]  Bamshad Mobasher,et al.  Intelligent Techniques for Web Personalization , 2005, Lecture Notes in Computer Science.

[16]  Pádraig Cunningham,et al.  Ontology Discovery for the Semantic Web Using Hierarchical Clustering , 2002 .

[17]  Philip S. Yu,et al.  LinkClus: efficient clustering via heterogeneous semantic links , 2006, VLDB.

[18]  Bamshad Mobasher,et al.  A Road Map to More Effective Web Personalization: Integrating Domain Knowledge with Web Usage Mining , 2003, International Conference on Internet Computing.

[19]  Daniel Boley,et al.  Hierarchical Taxonomies using Divisive Partitioning , 1998 .

[20]  Raghu Krishnapuram,et al.  Automatic Taxonomy Generation: Issues and Possibilities , 2003, IFSA.

[21]  Bernard De Baets,et al.  Fuzzy Sets and Systems — IFSA 2003 , 2003, Lecture Notes in Computer Science.