Discovering potential user browsing behaviors using custom-built apriori algorithm

Most of the organizations put information on the web because they want it to be seen by the world. Their goal is to have visitors come to the site, feel comfortable and stay a while and try to know completely about the running organization. As educational system increasingly requires data mining, the opportunity arises to mine the resulting large amounts of student information for hidden useful information (patterns like rule, clustering, and classification, etc). The education domain offers ground for many interesting and challenging data mining applications like astronomy, chemistry, engineering, climate studies, geology, oceanography, ecology, physics, biology, health sciences and computer science. Collecting the interesting patterns using the required interestingness measures, which help us in discovering the sophisticated patterns that are ultimately used for developing the site. We study the application of data mining to educational log data collected from Guru Nanak Institute of Technology, Ibrahimpatnam, India. We have proposed a custom-built apriori algorithm to find the effective pattern analysis. Finally, analyzing web logs for usage and access trends can not only provide important information to web site developers and administrators, but also help in creating adaptive web sites.

[1]  Mike Thelwall,et al.  Web log file analysis: backlinks and queries , 2001, Aslib Proc..

[2]  Victor Ciesielski,et al.  Data Mining of Web Access Logs From an Academic Web Site , 2003, HIS.

[3]  Yongjian Fu,et al.  A Framework for Personal Web Usage Mining , 2002, International Conference on Internet Computing.

[4]  Jyh-haw Yeh,et al.  World Wide Web Usage Mining Systems and Technologies , 2003 .

[5]  István Vajk,et al.  Different Aspects of Web Log Mining , 2005 .

[6]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[7]  Zhenglu Yang,et al.  An Effective System for Mining Web Log , 2006, APWeb.

[8]  Wang Tong,et al.  Web Log Mining by an Improved AprioriAll Algorithm , 2007 .

[9]  Ramakrishnan Srikant,et al.  Mining web logs to improve website organization , 2001, WWW '01.

[10]  Yves Lechevallier,et al.  Mining Web Usage Data for Discovering Navigation Clusters , 2006, 11th IEEE Symposium on Computers and Communications (ISCC'06).

[11]  Mitesh Sharma,et al.  Efficient Support Coupled Frequent Pattern Mining Over Progressive Databases , 2010, ArXiv.

[12]  Osmar R. Zaïane,et al.  Web Usage Mining for a Better Web-Based Learning Environment , 2001 .

[13]  Wei-Ying Ma,et al.  Log mining to improve the performance of site search , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering (Workshops), 2002..

[14]  MAGDALINI EIRINAKI,et al.  Web mining for web personalization , 2003, TOIT.

[15]  Pier Luca Lanzi,et al.  Recent Developments in Web Usage Mining Research , 2003, DaWaK.

[16]  Giorgio Maria Di Nunzio,et al.  Web Log Mining : A Study of User Sessions , 2007 .

[17]  Sujni Paul,et al.  An Optimized Distributed Association Rule Mining Algorithm in Parallel and Distributed Data Mining with XML Data for Improved Response Time , 2010 .