A Novel Technique for Sessions Identification in Web Usage Mining Preprocessing

The growth of World Wide Web is incredible as it can be seen in present days. Users find it very difficult to extract useful and relevant information from the huge amount of information. The problems can be solved by Web Usage Mining which involves preprocessing, pattern discovery and pattern analysis. Preprocessing is an important process which converts raw web log data into transactions. Application of mining techniques to group user‟s behavior for personalization is effectively done on transactions constructed from sessions. Sessionization is the identification of sessions and is defined as a set of pages visited by the same user within the duration of one particular visit to a web-site. In this research paper, a new technique for identifying sessions is being proposed for extraction of user patterns. The experimental results show that the proposed Session Identification technique is an effective one to construct sessions accurately.

[1]  Yan Li,et al.  Research on Path Completion Technique in Web Usage Mining , 2008, 2008 International Symposium on Computer Science and Computational Technology.

[2]  Wei Liu,et al.  Research for Web Usage Mining Model , 2006, 2006 International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce (CIMCA'06).

[3]  Murat Ali Bayir,et al.  Smart Miner: a new framework for mining large scale web usage data , 2009, WWW '09.

[4]  Pablo E. Román,et al.  Web User Session Reconstruction Using Integer Programming , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[5]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[6]  Masahiro Terabe,et al.  A Novel Web Usage Mining Method - Mining and Clustering of DAG Access Patterns Considering Page Browsing Time , 2008, WEBIST.

[7]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[8]  Myra Spiliopoulou,et al.  A Framework for the Evaluation of Session Reconstruction Heuristics in Web-Usage Analysis , 2003, INFORMS J. Comput..

[9]  Murat Ali Bayir,et al.  Discovering More Accurate Frequent Web Usage Patterns , 2008, ArXiv.

[10]  Zhang Huiying,et al.  An intelligent algorithm of data pre-processing in Web usage mining , 2004, Fifth World Congress on Intelligent Control and Automation (IEEE Cat. No.04EX788).

[11]  Marco Furini,et al.  International Journal of Computer and Applications , 2010 .

[12]  Mehdi Heydari,et al.  A graph-based web usage mining method considering client side data , 2009, 2009 International Conference on Electrical Engineering and Informatics.

[13]  Siu Cheung Hui,et al.  An Effective Approach for Periodic Web Personalization , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[14]  Javier Lorenzo,et al.  A Tool for Web Usage Mining , 2007, IDEAL.

[15]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.