Knowledge Discovery from Web Usage Data: An Efficient Implementation of Web Log Preprocessing Techniques

Web Usage Mining (WUM) refers to extraction of knowledge from the web log data by application of data mining techniques. WUM generally consists of Web Log Preprocessing, Web Log Knowledge Discovery and Web Log Pattern Analysis. Web Log Preprocessing is a major and complex task of WUM. Elimination of noise and irrelevant data, thereby reducing the burden on the system leads to efficient discovery of patterns by further stages of WUM. In this paper, Web Log Preprocessing Methods to efficiently identify users and user sessions have been implemented and results have been analyzed.

[1]  Arumugam Gurusamy,et al.  Optimal Algorithms for Generation of User Session Sequences Using Server Side Web User Logs , 2009, 2009 International Conference on Network and Service Security.

[2]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[3]  Ibrahim Türkoglu,et al.  Creating meaningful data from web logs for improving the impressiveness of a website by using path analysis method , 2009, Expert Syst. Appl..

[4]  K. Sudheer Reddy,et al.  Preprocessing the web server logs: an illustrative approach for effective usage mining , 2012, SOEN.

[5]  K. Sudheer Reddy,et al.  An effective data preprocessing method for Web Usage Mining , 2013, 2013 International Conference on Information Communication and Embedded Systems (ICICES).

[6]  R.M. Suresh,et al.  An Overview of Data Preprocessing in Data and Web Usage Mining , 2007, 2006 1st International Conference on Digital Information Management.

[7]  P. Sumathi,et al.  Novel pre-processing technique for web log mining by removing global noise and web robots , 2012, 2012 NATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION SYSTEMS.

[8]  Supriya Kumar De,et al.  Clustering web transactions using rough approximation , 2004, Fuzzy Sets Syst..

[9]  Zhang Huiying,et al.  An intelligent algorithm of data pre-processing in Web usage mining , 2004, Fifth World Congress on Intelligent Control and Automation (IEEE Cat. No.04EX788).

[10]  N. V. Subba Reddy,et al.  Knowledge Discovery from Web Usage Data: A Survey of Web Usage Pre-processing Techniques , 2010, BAIP.

[11]  Brijesh Bakariya,et al.  "An Inclusive Survey on Data Preprocessing Methods Used in Web Usage Mining" , 2012, BIC-TA.

[12]  Theint Theint Aye,et al.  Web log cleaning for mining of web usage patterns , 2011, 2011 3rd International Conference on Computer Research and Development.