A framework for mining meaningful usage patterns within a semantically enhanced web portal

Semantic Web (SW) is a new trend in the evolution of the current Web aimed at extending its basic functionalities by providing computer-readable semantic meta-data about the Web content. The meta-data is typically organized into a domain ontology where key concepts and relations from the domain appear. The benefits of such a representation are manifold: a more topical information seeking process, better content adaptation and higher interoperability even on the current, still largely syntactical, Web, to name only a few. As the SW is, arguably, the future of the Web, it is only too natural that Web mining, i.e., the application of data mining techniques to web-related data, tackles the processing semantically annotated data. In this context, we study the detecting of typical navigation scenarios on an ontology-powered Web portal, i.e., an instance of usage mining on the SW. In the present paper, we tackle the fundamental aspects of the underlying mining problem and clarify the impact a fully-fledged ontology has on the data and pattern languages. Indeed, current ontology-aware mining approaches tend to limit their scope to the core conceptual hierarchy (taxonomy) of an ontology whereas in a realistic settings there will be a lot more knowledge in the ontology, in particular, on semantic relations between domain concepts, the way they instantiate into links between content objects, etc. We show that reflecting domain relations in the navigation patterns results in a new pattern structure that combines elements from sequential, generalized and graph pattern mining and therefore requires a dedicated mining strategy. After characterizing the underlying pattern space, we describe a dedicated level-wise mining method and present some empirical evidence of its viability.

[1]  Michalis Vazirgiannis,et al.  Introducing Semantics in Web Personalization: The Role of Ontologies , 2005, EWMF/KDO.

[2]  Duncan Dubugras Alcoba Ruiz,et al.  Ontology-Based Filtering Mechanisms for Web Usage Patterns Retrieval , 2005, EC-Web.

[3]  Gultekin Özsoyoglu,et al.  Taxonomy-superimposed graph mining , 2008, EDBT '08.

[4]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[5]  Chabane Djeraba,et al.  Toward Recommendation Based on Ontology-Powered Web-Usage Mining , 2007, IEEE Internet Computing.

[6]  Jan Rauch,et al.  Ontology-Enhanced Association Mining , 2005, EWMF/KDO.

[7]  Georgios Paliouras,et al.  Web Usage Mining as a Tool for Personalization: A Survey , 2003, User Modeling and User-Adapted Interaction.

[8]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[9]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[10]  Bamshad Mobasher,et al.  Using Ontologies to Discover Domain-Level Web Usage Profiles , 2002 .

[11]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[12]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[13]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[14]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[15]  Mehdi Adda Intégration des connaissances ontologiques dans la fouille de motifs séquentiels avec application à la personnalisation web , 2008 .

[16]  Joost N. Kok,et al.  Efficient Frequent Query Discovery in FARMER , 2003, PKDD.

[17]  Andreas Hotho,et al.  Towards Semantic Web Mining , 2002, SEMWEB.

[18]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.