"An Inclusive Survey on Data Preprocessing Methods Used in Web Usage Mining"

Several data mining techniques applied in Web usage mining applications for discovering user access pattern from web log data. To understand and provide better services it will require Web-based applications. Web usage mining is one of the types of Web mining. Web mining is the technique to extract knowledge from web content, structure and usage. It is the collection of technologies to accomplish the possible of extracting valuable knowledge from the World Wide Web and its usage pattern. Web mining enables to find out relevant result from Web data including web document, hyperlink between documents, usage log of website etc. There are three main areas of web mining research –content, structure and usage. This paper provide an overview of previous and existing work in all three areas, and also define an overview of data preprocessing process like Data Cleaning, User Identification, Session Identification, Transaction Identification, Path Completion used in Web usage mining.

[1]  Ling Zheng,et al.  Optimized data preprocessing technology for web log mining , 2010, 2010 International Conference On Computer Design and Applications.

[2]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[3]  M. H. Margahny,et al.  FAST ALGORITHM FOR MINING ASSOCIATION RULES , 2014 .

[4]  Wei Liang,et al.  A Hybrid Recommender System Combining Web Page Clustering with Web Usage Mining , 2009, 2009 International Conference on Computational Intelligence and Software Engineering.

[5]  Haiyang Zhang The Research of Web Mining in E-Commerce , 2011, 2011 International Conference on Management and Service Science.

[6]  Mehmed Kantardzic,et al.  Data-Mining Concepts , 2011 .

[7]  Bharati Vidyapeeth,et al.  A Effective and Complete Preprocessing for Web Usage Mining , 2010 .

[8]  Mohammad Fraiwan,et al.  Converting web applications into standard XML web services: Two case studies , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[9]  Kobra Etminani,et al.  Web usage mining: Discovery of the users' navigational patterns using SOM , 2009, 2009 First International Conference on Networked Digital Technologies.

[10]  G T Raju,et al.  Knowledge Discovery from Web Usage Data: Complete Preprocessing Methodology , 2008 .

[11]  Nitin Shukla,et al.  Extracting Knowledge from User Access Logs , 2012 .

[12]  Huiying Zhang,et al.  Research on Application of User Navigation Pattern Mining Recommendation , 2006, 2006 6th World Congress on Intelligent Control and Automation.

[13]  R. S. Thakur,et al.  Rule Generation from Textual Data by using Graph based Approach , 2011 .

[14]  Rahul Nayak,et al.  Web Usage Mining by Data Preprocessing , 2012 .

[15]  M. Kiruthika,et al.  PREPROCESSING OF WEB LOGS , 2010 .

[16]  T. Revathi,et al.  An Enhanced Pre-Processing Research Framework for Web Log Data , 2012 .

[17]  R. Krishnamoorthi,et al.  Identifying User Behavior by Analyzing Web Server Access Log File , 2009 .

[18]  M. Rivera R and R , 2012 .

[19]  Mohamed I. Marie,et al.  Web Server Logs Preprocessing for Web Intrusion Detection , 2011, Comput. Inf. Sci..

[20]  Chien-Chung Chan,et al.  Active User-Based and Ontology-Based Web Log Data Preprocessing for Web Usage Mining , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[21]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[22]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[23]  Mohd Norzali Haji Mohd,et al.  Data pre-processing on web server logs for generalized association rules mining algorithm , 2008 .

[24]  Xiang-ying Li Data Preprocessing in Web Usage Mining , 2013 .

[25]  Jaideep Srivastava,et al.  Grouping Web page references into transactions for mining World Wide Web browsing patterns , 1997, Proceedings 1997 IEEE Knowledge and Data Engineering Exchange Workshop.

[26]  R. Krishnamoorthi Data Preprocessing and Easy Access Retrieval of Data through Data Ware House , 2009 .

[27]  Yan Li,et al.  Research on Path Completion Technique in Web Usage Mining , 2008, 2008 International Symposium on Computer Science and Computational Technology.

[28]  Liu Wenyun,et al.  Application of Web Mining in E-Commerce Enterprises Knowledge Management , 2010, ICEE.

[29]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[30]  Sándor Juhász,et al.  Analysis of Web User Identification Methods , 2007 .

[31]  G. Kavitha,et al.  An Efficient Preprocessing Methodology for Discovering Patterns and Clustering of Web Users using a Dynamic ART1 Neural Network , 2011, ArXiv.

[32]  Bin Liu,et al.  Discovering Web usage patterns by mining cross-transaction association rules , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[33]  Tasawar Hussain,et al.  Web usage mining: A survey on preprocessing of web log file , 2010, 2010 International Conference on Information and Emerging Technologies.

[34]  Lizhen Liu,et al.  The research of Web mining , 2002, Proceedings of the 4th World Congress on Intelligent Control and Automation (Cat. No.02EX527).