Hybrid Data Aggregation Technique to Categorize the Web Users to Discover Knowledge About the Web Users

Web usage mining is a knowledge discovery technique where a data analyst can discover useful information from the web users’ data. Web contains billions of web pages. The web access behaviour of one web user differs from that of another and also it changes with respect to their temporal property. By analyzing the users’ data, the web administrator can personalize the web pages according to individual web users’ interest. Personalizing the web page gives various advantages in this fast era such as low search time, less data transfer, higher availability of data, lower bandwidth traffic, targeted advertisement and identifying the threaded web users and high web users satisfaction. Due to the above advantages, it is very much essential in the present World Wide Web. Various algorithms, techniques and tools are available in the field of web usage mining. Although there are various techniques, algorithms and tools developed related to web usage mining, new techniques are required to make the discovery of knowledge more accurate. In this paper, a novel technique is proposed by using various methods such as web log, web ranking, web rating and web review based method to identify the success rate of various web pages and summarize the value to identify the accurate success rate of each web page. The success rate is normalized and aggregated into three categories for personalizing the web user. Personalizing the web user based on grouping relevant web access behaviour reduces the calculation complexity. It is very effective in very large websites. This technique is very much effective for analyzing the outreach of web advertisement to the web users.

[1]  Cheng Fang,et al.  Request Dependency Graph: A Model for Web Usage Mining in Large-Scale Web of Things , 2016, IEEE Internet of Things Journal.

[2]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[3]  Huan Liu,et al.  An Unsupervised Feature Selection Framework for Social Media Data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[4]  Ed Huai-hsin Chi Improving Web Usability Through Visualization , 2002, IEEE Internet Comput..

[5]  Georgios Paliouras,et al.  Personalizing Web Directories with the Aid of Web Usage Data , 2010, IEEE Transactions on Knowledge and Data Engineering.

[6]  Ajith Abraham i-Miner: a Web usage mining framework using hierarchical intelligent systems , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[7]  Maguelonne Teisseire,et al.  Web usage mining: extracting unexpected periods from web logs , 2005, Data Mining and Knowledge Discovery.

[8]  Tzung-Pei Hong,et al.  Web usage mining with intentional browsing data , 2008, Expert Syst. Appl..

[9]  Bidisha Roy,et al.  Dynamic Recommendation System Using Web Usage Mining for E-commerce Users , 2015 .

[10]  A. Joshi,et al.  Web mining: research and practice , 2004, Computing in Science & Engineering.

[11]  Mamata Jenamani,et al.  Online Customized Index Synthesis in Commercial Web Sites , 2002, IEEE Intell. Syst..

[12]  Jiming Liu,et al.  Characterizing Web usage regularities with information foraging agents , 2004, IEEE Transactions on Knowledge and Data Engineering.

[13]  James Nga-Kwok Liu,et al.  iJADE Web-miner: an intelligent agent framework for Internet shopping , 2004, IEEE Transactions on Knowledge and Data Engineering.

[14]  Anjali Patel,et al.  Web usage Mining:Frequent Pattern Generation using Association Rule Mining and Clustering , 2015 .

[15]  Yoon Ho Cho,et al.  Application of Web usage mining and product taxonomy to collaborative recommendations in e-commerce , 2004, Expert Syst. Appl..

[16]  Jie Lu,et al.  Web-Page Recommendation Based on Web Usage and Domain Knowledge , 2014 .

[17]  Chhavi Rana,et al.  A Study of Web Usage Mining Research Tools , 2012 .

[18]  Jeffrey Heer,et al.  WebQuilt : A Proxy-based Approach to Remote Web Usability Testing , 2001 .

[19]  Pinar Senkul,et al.  Improving pattern quality in web usage mining by using semantic information , 2012, Knowledge and Information Systems.

[20]  Chabane Djeraba,et al.  Toward Recommendation Based on Ontology-Powered Web-Usage Mining , 2007, IEEE Internet Computing.

[21]  Darshak B. Mehta,et al.  Web Usage Mining Using Association Rule Mining on Clustered Data for Pattern Discovery , 2013 .

[22]  Balaji Padmanabhan,et al.  GHIC: a hierarchical pattern-based clustering algorithm for grouping Web transactions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[23]  Maguelonne Teisseire,et al.  Real time Web usage mining: a heuristic based distributed miner , 2001, Proceedings of the Second International Conference on Web Information Systems Engineering.

[24]  Jeff Tian,et al.  Improving Web Navigation Usability by Comparing Actual and Anticipated Usage , 2015, IEEE Transactions on Human-Machine Systems.

[25]  V. Valli Kumari,et al.  Understanding User Behavior using Web Usage Mining , 2010 .

[26]  Anour F. A. Dafa-Alla,et al.  Extracting knowledge from web server logs using web usage mining , 2013, 2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONIC ENGINEERING (ICCEEE).

[27]  Shian-Shyong Tseng,et al.  Data types generalization for data mining algorithms , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[28]  Xuegang Hu,et al.  Discovery of user frequent access patterns on Web usage mining , 2004, 8th International Conference on Computer Supported Cooperative Work in Design.

[29]  Qingtian Han,et al.  Research of an Algorithm Based on Web Usage Mining , 2009, 2009 International Workshop on Intelligent Systems and Applications.

[30]  Vladimir B. Bajic,et al.  An Intelligent System for Vertebrate Promoter Recognition , 2002, IEEE Intell. Syst..

[31]  Brigitte Trousse,et al.  Advanced data preprocessing for intersites Web usage mining , 2004, IEEE Intelligent Systems.

[32]  Sung Ho Ha,et al.  Helping Online Customers Decide through Web Personalization , 2002, IEEE Intell. Syst..

[33]  Georgios Paliouras,et al.  Large-Scale Mining of Usage Data on Web Sites , 2000 .

[34]  Siu Cheung Hui,et al.  Generation of Personalized Ontology Based on Consumer Emotion and Behavior Analysis , 2012, IEEE Transactions on Affective Computing.

[35]  Antonio Badia,et al.  A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites , 2008, IEEE Transactions on Knowledge and Data Engineering.

[36]  Philip S. Yu,et al.  SpeedTracer: A Web Usage Mining and Analysis Tool , 1998, IBM Syst. J..

[37]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[38]  Georgios Paliouras,et al.  Web Usage Mining as a Tool for Personalization: A Survey , 2003, User Modeling and User-Adapted Interaction.

[39]  Sung Ho Ha,et al.  Fuzzy Web Ad Selector Based on Web Usage Mining , 2003, IEEE Intell. Syst..