Integrating Semantic Knowledge with Web Usage Mining for Personalization

Web usage mining has been used effectively as an approach to automatic personalization and as a way to overcome deficiencies of traditional approaches such as collaborative filtering. Despite their success, such systems, as in more traditional ones, do not take into account the semantic knowledge about the underlying domain. Without such semantic knowledge, personalization systems cannot recommend different types of complex objects based on their underlying properties and attributes. Nor can these systems possess the ability to automatically explain or reason about the user models or user recommendations. The integration of semantic knowledge is, in fact, the primary challenge for the next generation of personalization systems. In this chapter we provide an overview of approaches for incorporating semantic knowledge into Web usage mining and personalization processes. In particular, we discuss the issues and requirements for successful integration of semantic knowledge from different sources, such as the content and the structure of Web sites for personalization. Finally, we present a general framework for fully integrating domain ontologies with Web usage This chapter appears in the book, Web Mining: Applications and Techniques, edited by Anthony Scime. Copyright © 2005, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. 701 E. Chocolate Avenue, Suite 200, Hershey PA 17033-1240, USA Tel: 717/533-8845; Fax 717/533-8661; URL-http://www.idea-group.com IDEA GROUP PUBLISHING Integrating Semantic Knowledge with Web Usage Mining for Personalization 277 Copyright © 2005, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. mining and personalization processes at different stages, including the preprocessing and pattern discovery phases, as well as in the final stage where the discovered patterns are used for personalization. INTRODUCTION With the continued growth and proliferation of e-commerce, Web services, and Web-based information systems, personalization has emerged as a critical application that is essential to the success of a Website. It is now common for Web users to encounter sites that provide dynamic recommendations for products and services, targeted banner advertising, and individualized link selections. Indeed, nowhere is this phenomenon more apparent as in the business-to-consumer e-commerce arena. The reason is that, in today’s highly competitive e-commerce environment, the success of a site often depends on the site’s ability to retain visitors and turn casual browsers into potential customers. Automatic personalization and recommender system technologies have become critical tools, precisely because they help engage visitors at a deeper and more intimate level by tailoring the site’s interaction with a visitor to her needs and interests. Web personalization can be defined as any action that tailors the Web experience to a particular user, or a set of users (Mobasher, Cooley & Srivastava, 2000a). The experience can be something as casual as browsing a Website or as (economically) significant as trading stocks or purchasing a car. Principal elements of Web personalization include modeling of Web objects (pages, etc.) and subjects (users), categorization of objects and subjects, matching between and across objects and/or subjects, and determination of the set of actions to be recommended for personalization. The actions can range from simply making the presentation more pleasing to anticipating the needs of a user and providing customized information. Traditional approaches to personalization have included both content-based and user-based techniques. Content-based techniques use personal profiles of users and recommend other items or pages based on their content similarity to the items or pages that are in the user’s profile. The underlying mechanism in these systems is usually the comparison of sets of keywords representing pages or item descriptions. Examples of such systems include Letizia (Lieberman, 1995) and WebWatcher (Joachims, Freitag & Mitchell, 1997). While these systems perform well from the perspective of the end user who is searching the Web for information, they are less useful in e-commerce applications, partly due to the lack of server-side control by site owners, and partly because techniques based on content similarity alone may miss other types of semantic relationships among objects (for example, the associations among products or services that are semantically different, but are often used together). User-based techniques for personalization, on the other hand, primarily focus on the similarities among users rather than item-based similarities. The most widely used technology user-based personalization is collaborative filtering (CF) (Herlocker, Konstan, Borchers & Riedl, 1999). Given a target user’s record of activity or preferences, CF-based techniques compare that record with the historical records of other users in order to find the users with similar interests. This is the so-called neighborhood of the current user. The mapping of a visitor record to its neighborhood could be based on similarity in ratings of items, access to similar content or pages, or purchase of similar items. The identified 29 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the publisher's webpage: www.igi-global.com/chapter/integrating-semantic-knowledgeweb-usage/31143

[1]  John G. Gammack Constructive design environments: implementing end-user systems development , 1999 .

[2]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[3]  Claude Ghaoui,et al.  Encyclopedia of Human Computer Interaction , 2005 .

[4]  Oren Etzioni,et al.  Adaptive Web Sites: Automatically Synthesizing Web Pages , 1998, AAAI/IAAI.

[5]  R. Natarajan,et al.  EDSS for product maintenance in a global environment , 1991 .

[6]  Panagiotis Germanakos,et al.  Intelligent User Interfaces: Adaptation and Personalization Systems and Technologies , 2008 .

[7]  Luigi Palopoli,et al.  Uniform Techniques for Deriving Similarities of Objects and Subschemes in Heterogeneous Databases , 2003, IEEE Trans. Knowl. Data Eng..

[8]  Tao Luo,et al.  Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization , 2004, Data Mining and Knowledge Discovery.

[9]  B. Mobasher,et al.  USING SEMANTIC SIMILARITY TO ENHANCE ITEM-BASED COLLABORATIVE FILTERING , 2003 .

[10]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[11]  Mo Adam Mahmood,et al.  A New Approach to Evaluating Business Ethics : An Artificial Neural Networks Application , 2018 .

[12]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[13]  Martin van den Berg,et al.  Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.

[14]  Natalya Sergeeva,et al.  Towards a theoretical framework for creative participation : how personal characteristics influence employees' willingness to contribute ideas , 2011 .

[15]  Dieter Pfoser Indexing the Trajectories of Moving Objects , 2002 .

[16]  Andry Rakotonirainy,et al.  In-Vehicle Avatars to Elicit Social Response and Change Driving Behaviour , 2009, Int. J. Technol. Hum. Interact..

[17]  Mark Claypool,et al.  Combining Content-Based and Collaborative Filters in an Online Newspaper , 1999, SIGIR 1999.

[18]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[19]  David C. Rine,et al.  Integrated approaches in information technology and web engineering : advancing organizational knowledge sharing / [edited by] Ghazi Alkhatib, David Rine. , 2008 .

[20]  Tom M. Mitchell,et al.  Improving Text Classification by Shrinkage in a Hierarchy of Classes , 1998, ICML.

[21]  Yeliz Yesilada,et al.  How Do People Use Their Mobile Phones?: A Field Study of Small Device Users , 2011, Int. J. Mob. Hum. Comput. Interact..

[22]  Ian Horrocks,et al.  Ontology Reasoning in the SHOQ(D) Description Logic , 2001, IJCAI.

[23]  Siegfried Handschuh,et al.  Ontology-Focused Crawling of Documents and Relational Metadata , 2003, WWW 2003.

[24]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[25]  Mervyn A. Jack,et al.  Multimedia Technology in the Financial Services Sector: Customer Satisfaction with Alternatives to Face-to-Face Interaction in Mortgage Sales , 2011, Int. J. Technol. Hum. Interact..

[26]  Gheorghita Ghinea,et al.  Impact of Cognitive Style on User Perception of Dynamic Video Content , 2009 .

[27]  Steffen Staab,et al.  Discovering Conceptual Relations from Text , 2000, ECAI.

[28]  Thorsten Joachims,et al.  Web Watcher: A Tour Guide for the World Wide Web , 1997, IJCAI.

[29]  Themistoklis Palpanas,et al.  Web prefetching using partial match prediction , 1998 .

[30]  Jiawei Han,et al.  Discovery of Multiple-Level Association Rules from Large Databases , 1995, VLDB.

[31]  Eitel J. M. Lauría Exploring the Behavioral Dimension of Client/Server Technology Implementation: An Empirical Investigation , 2006, Int. J. Technol. Hum. Interact..

[32]  Pádraig Cunningham,et al.  Ontology Discovery for the Semantic Web Using Hierarchical Clustering , 2002 .

[33]  Steffen Staab,et al.  Ontology-based text clustering , 2001, IJCAI 2001.

[34]  Pedro M. Domingos,et al.  Relational Markov models and their application to adaptive web navigation , 2002, KDD.

[35]  Bernd Carsten Stahl,et al.  Responsibility for Information Assurance and Privacy: A Problem of Individual Ethics? , 2004, J. Organ. End User Comput..

[36]  José Palazzo Moreira de Oliveira,et al.  Concept-based knowledge discovery in texts extracted from the Web , 2000, SKDD.

[37]  Myra Spiliopoulou,et al.  Analysis of navigation behaviour in web sites integrating multiple information systems , 2000, The VLDB Journal.

[38]  Steve Clarke,et al.  End-user Computing: Concepts, Methodologies, Tools and Applications , 2008 .

[39]  Sergio Greco,et al.  Proceedings of the European Conference on Logics in Artificial Intelligence , 2002 .

[40]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[41]  Anabela Mesquita,et al.  Technology for Creativity and Innovation: Tools, Techniques and Applications , 2011 .

[42]  Jon M. Kleinberg,et al.  Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.

[43]  Christoph Igel,et al.  From E-Learning Tools to Assistants by Learner Modelling and Adaptive Behavior , 2007 .

[44]  Jennifer Widom,et al.  Exploiting hierarchical domain structure to compute similarity , 2003, TOIS.

[45]  Andrew E. Fano,et al.  Building Recommender Systems using a Knowledge Base of Product Semantics , 2002 .

[46]  Tao Luo,et al.  Effective personalization based on association rule discovery from web usage data , 2001, WIDM '01.

[47]  C. Lee Giles,et al.  CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications , 1998, AGENTS '98.

[48]  Ben Taskar,et al.  Learning Probabilistic Models of Relational Structure , 2001, ICML.

[49]  Henry Lieberman,et al.  Letizia: An Agent That Assists Web Browsing , 1995, IJCAI.

[50]  Tao Luo,et al.  Integrating Web Usage and Content Mining for More Effective Personalization , 2000, EC-Web.

[51]  Myra Spiliopoulou,et al.  Web usage mining for Web site evaluation , 2000, CACM.

[52]  Bamshad Mobasher,et al.  Using Ontologies to Discover Domain-Level Web Usage Profiles , 2002 .

[53]  Max J. Egenhofer,et al.  Determining Semantic Similarity among Entity Classes from Different Ontologies , 2003, IEEE Trans. Knowl. Data Eng..

[54]  T. Joachims WebWatcher : A Tour Guide for the World Wide Web , 1997 .

[55]  A. M. Shapiro Site Maps for Hypertext , 2006 .

[56]  Bamshad Mobasher,et al.  An Adaptive Agent for Web Exploration Based on Concept Hierarchies , 2001 .

[57]  Gerd Stumme,et al.  Fast Computation of Concept lattices Using Data Mining Techniques , 2000, KRDB.

[58]  Mari Ervasti,et al.  Touch-Based Access to Mobile Internet: User Experience Findings , 2009, Int. J. Mob. Hum. Comput. Interact..

[59]  Michael J. Pazzani,et al.  A Framework for Collaborative, Content-Based and Demographic Filtering , 1999, Artificial Intelligence Review.

[60]  Tom M. Mitchell,et al.  Learning to construct knowledge bases from the World Wide Web , 2000, Artif. Intell..

[61]  Andreas Hotho,et al.  Towards Semantic Web Mining , 2002, SEMWEB.

[62]  Ricardo Gomez Success Factors in Public Access Computing for Development , 2011, Int. J. Inf. Commun. Technol. Hum. Dev..

[63]  Thomas Lukasiewicz,et al.  P-SHOQ(D): A Probabilistic Extension of SHOQ(D) for Probabilistic Ontologies in the Semantic Web , 2002, JELIA.