PNP: mining of profile navigational patterns

Web usage mining is a key knowledge discovery research and as such has been well researched. So far, this research has focused mainly on databases containing access log data only. However, many real-world databases contain users profile data and current solutions for this situation are still insufficient. In this paper we have a large database containing of user profile information together with user web-pages navigation patterns. The user profile data includes quantitative attributes, such as salary or age, and categorical attributes, such as sex or marital status. We introduce the concept of profile navigation patterns, which discusses the problem of relating user profile information to navigational behavior. An example of such profile navigation pattern might be 20% of married people between age 25 and 30 have the similar navigational behavior <(a,c)(c,h)(h,i)(i,h)(h,l)>, where a, c, h, i, l are web pages in a web site. The navigation patterns may contain the generic traversal behavior, e.g. trend to backward moves, cycles etc. The objective of mining profile navigation patterns is to identify browser profile for web personalization. We give an algorithm for mining such profile navigation patterns. Our method (algorithm PNP) can discover profile navigation patterns efficiently. We also present new inclination measurements to identify the interesting profile navigational patterns. Experimental results show the efficiency and scalability of PNP.

[1]  Philip S. Yu,et al.  Efficient Data Mining for Path Traversal Patterns , 1998, IEEE Trans. Knowl. Data Eng..

[2]  Prabhakar Raghavan,et al.  Mining the Link Structure of the World Wide Web , 1998 .

[3]  Jian Pei,et al.  Mining Access Patterns Efficiently from Web Logs , 2000, PAKDD.

[4]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[5]  Myra Spiliopoulou,et al.  WUM: A tool for Web Utilization analysis , 1999 .

[6]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[7]  WebUsersMyra Spiliopoulou,et al.  A Data Miner analyzing the Navigational Behaviour of , 1999 .

[8]  Jiawei Han,et al.  Discovering Web access patterns and trends by applying OLAP and data mining technology on Web logs , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[9]  Jaideep Srivastava,et al.  Grouping Web page references into transactions for mining World Wide Web browsing patterns , 1997, Proceedings 1997 IEEE Knowledge and Data Engineering Exchange Workshop.

[10]  Tom M. Mitchell,et al.  Learning to Extract Symbolic Knowledge from the World Wide Web , 1998, AAAI/IAAI.

[11]  Philip S. Yu,et al.  SpeedTracer: A Web Usage Mining and Analysis Tool , 1998, IBM Syst. J..