Prioritizing web links based on web usage and content data

Web has grown enormously and is still growing rapidly day by day. With this huge amount of information in the Web it has become difficult for the search engines to retrieve the required and relevant information efficiently. Web mining techniques, using different approaches, have contributed a lot in providing the relevant information to the user query. This paper introduces a new method for prioritizing the Web pages based on Web usage and Web content data. The proposed method uses Genetic Algorithm for providing good quality Web pages as a result of user query. Prioritization of Web pages falls in the category of NP-complete problems. Genetic algorithm is used to deal with this. The method includes the parameters from both Web usage and Web content mining. Experimental results show that the proposed approach performed better than the existing approach.

[1]  Harris Wu,et al.  The effects of fitness functions on genetic programming-based ranking discovery forWeb search , 2004, J. Assoc. Inf. Sci. Technol..

[2]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[3]  Nicolas Monmarché,et al.  GeniMiner: Web Mining with a Genetic-Based Algorithm , 2002, ICWI.

[4]  Chang Wook Ahn,et al.  On the practical genetic algorithms , 2005, GECCO '05.

[5]  Tom V. Mathew Genetic Algorithm , 2022 .

[6]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[7]  Weiguo Fan,et al.  Genetic Programming-Based Discovery of Ranking Functions for Effective Web Search , 2005, J. Manag. Inf. Syst..

[8]  Ajith Abraham,et al.  Web usage mining using artificial ant colony clustering and linear genetic programming , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[9]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[10]  Dell Zhang,et al.  A novel Web usage mining approach for search engines , 2002, Comput. Networks.

[11]  Xin Jin,et al.  Web usage mining based on probabilistic latent semantic analysis , 2004, KDD.

[12]  S. Ramkumar A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites , 2014 .

[13]  A Young Data mining, text mining and their business applications , 2005 .

[14]  Zbigniew Michalewicz,et al.  Fundamentals of genetic algorithms , 2000 .

[15]  Kobra Etminani,et al.  Web usage mining: Discovery of the users' navigational patterns using SOM , 2009, 2009 First International Conference on Networked Digital Technologies.

[16]  Ramón García-Martínez,et al.  Web Usage Mining Using Self Organized Maps , 2007 .

[17]  Angus R. Simpson,et al.  Genetic algorithms compared to other techniques for pipe optimization , 1994 .

[18]  A. Tjoa,et al.  Information and Communication Technologies in Tourism , 1996, Springer Vienna.

[19]  Sankar K. Pal,et al.  Web mining in soft computing framework: relevance, state of the art and future directions , 2002, IEEE Trans. Neural Networks.

[20]  Sebastián Ventura,et al.  Rule Discovery In Web-based EducationalSystems Using Grammar-Based GeneticProgramming , 2005 .

[21]  Mohamed A. El-Sharkawi,et al.  Fundamentals of Genetic Algorithms , 2008 .

[22]  Chang-Chun Lin,et al.  Optimal Web site reorganization considering information overload and search depth , 2006, Eur. J. Oper. Res..

[23]  Maria Lexhagen,et al.  Web Usage Mining in Tourism - A Query Term Analysis and Clustering Approach , 2010, ENTER.

[24]  Mahmudur Rahman,et al.  Pattern Discovery of Web Usage Mining , 2009, 2009 International Conference on Computer Technology and Development.