Different Aspects of Web Log Mining

The expansion of the World Wide Web has resulted in a large amount of data that is now freely available for user access. The data have to be managed and organized in such a way that the user can access them efficiently. For this reason the application of data mining techniques on the Web is now the focus of an increasing number of researchers. One key issue is the investigation of user navigational behavior from different aspects. For this reason different types of data mining techniques can be applied on the log file collected on the servers. In this paper three of the most important approaches are introduced for web log mining. All the three methods are based on the frequent pattern mining approach. The three types of patterns that can be used for obtain useful information about the navigational behavior of the users are page set, page sequence and page graph mining.

[1]  Kyuseok Shim,et al.  Data mining and the Web: past, present and future , 1999, WIDM '99.

[2]  Xin Jin,et al.  Web usage mining based on probabilistic latent semantic analysis , 2004, KDD.

[3]  Jian Pei,et al.  Mining Access Patterns Efficiently from Web Logs , 2000, PAKDD.

[4]  Mário J. Silva,et al.  Mining Web Access Logs of an On-line Newspaper , 2002 .

[5]  Yanchun Zhang,et al.  Effectively Finding Relevant Web Pages from Linkage Information , 2003, IEEE Trans. Knowl. Data Eng..

[6]  Yannis Manolopoulos,et al.  Finding Generalized Path Patterns for Web Log Data Mining , 2000, ADBIS-DASFAA.

[7]  MAGDALINI EIRINAKI,et al.  Web mining for web personalization , 2003, TOIT.

[8]  Yannis Manolopoulos,et al.  A Data Mining Algorithm for Generalized Web Prefetching , 2003, IEEE Trans. Knowl. Data Eng..

[9]  Fillia Makedon,et al.  Mining the Most Interesting Web Access Associations , 2000, WebNet.

[10]  Jon M. Kleinberg,et al.  The Web as a Graph: Measurements, Models, and Methods , 1999, COCOON.

[11]  Yannis Manolopoulos,et al.  Exploiting Web Log Mining for Web Cache Enhancement , 2001, WEBKDD.

[12]  Piotr Indyk,et al.  Enhanced hypertext categorization using hyperlinks , 1998, SIGMOD '98.

[13]  Philip S. Yu,et al.  Data mining for path traversal patterns in a web environment , 1996, Proceedings of 16th International Conference on Distributed Computing Systems.

[14]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[15]  Renata Iváncsy,et al.  A time- and memory-efficient frequent itemset discovering algorithm for association rule mining , 2006, Int. J. Comput. Appl. Technol..

[16]  Ramez Elmasri,et al.  Learning Rules for Conceptual Structure on the Web , 2004, Journal of Intelligent Information Systems.

[17]  Torben Bach Pedersen,et al.  Evaluating the markov assumption for web usage mining , 2003, WIDM '03.

[18]  Soumen Chakrabarti,et al.  Data mining for hypertext: a tutorial survey , 2000, SKDD.

[19]  Qiang Yang,et al.  Web-Log Mining for Predictive Web Caching , 2003, IEEE Trans. Knowl. Data Eng..

[20]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[21]  Mark Levene,et al.  Data Mining of User Navigation Patterns , 1999, WEBKDD.

[22]  Yoav Shoham,et al.  Learning Information Retrieval Agents: Experiments with Automated Web Browsing , 1995 .

[23]  Yanchun Zhang,et al.  Efficiently computing frequent tree-like topology patterns in a Web environment , 1999, Proceedings Technology of Object-Oriented Languages and Systems (Cat. No.PR00393).

[24]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[25]  István Vajk,et al.  Efficient sequential pattern mining algorithms , 2005 .

[26]  Mohammed J. Zaki,et al.  Web Usage Mining — Languages and Algorithms , 2003 .

[27]  Jaideep Srivastava,et al.  Revised Papers from the Third International Workshop on Mining Web Log Data Across All Customers Touch Points , 2001 .

[28]  Yannis Manolopoulos,et al.  Mining patterns from graph traversals , 2001, Data Knowl. Eng..

[29]  Jiawei Han,et al.  Discovering Web access patterns and trends by applying OLAP and data mining technology on Web logs , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[30]  Ali A. Ghorbani,et al.  The reconstruction of user sessions from a server log using improved time-oriented heuristics , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..