Semantic Web Prefetching Scheme using Naïve Bayes Classifier

Web prefetching is an effective technique to minimize user’s web access latency. Web page content provides useful information for making the predictions that is used to perform prefetching of web objects. In this paper we propose semantic prefetching scheme that uses anchor texts present in the web page to make effective predictions. The scheme applies Naïve Bayes Classifier for computing the probability values of anchor texts, based on which the predictions are generated and given as input to the prefetch unit. Predictions are dynamic and are based on the browsing pattern exhibited by the user in a session. The browsing pattern of user should be specific towards a particular topic of interest to achieve good hit rate. When user has long browsing sessions, predictions and prefetching are more effective and helps to increase the hit rate and minimize access latency.

[1]  Kevin S. McCurley,et al.  Analysis of anchor text for web search , 2003, SIGIR.

[2]  Georgios Paliouras,et al.  Web Usage Mining as a Tool for Personalization: A Survey , 2003, User Modeling and User-Adapted Interaction.

[3]  Anoop Gupta,et al.  Parallel computer architecture - a hardware / software approach , 1998 .

[4]  Jason J. Jung Collaborative Web Browsing Based on Semantic Extraction of User Interests with Bookmarks , 2005, J. Univers. Comput. Sci..

[5]  Alexander P. Pons Semantic prefetching objects of slower web site pages , 2006, J. Syst. Softw..

[6]  Hai Zhuge,et al.  Active e-document framework ADF: model and tool , 2003, Inf. Manag..

[7]  Wei-Ying Ma,et al.  A unified framework for Web link analysis , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002..

[8]  Jaideep Srivastava,et al.  Discovery of Interesting Usage Patterns from Web Data , 1999, WEBKDD.

[9]  Stephen E. Robertson,et al.  Effective site finding using link anchor information , 2001, SIGIR '01.

[10]  Ioan Pop,et al.  An approach of the Naive Bayes classifier for the document classification 1 , 2006 .

[11]  Jon M. Kleinberg,et al.  Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.

[12]  Cheng-Zhong Xu,et al.  Neural nets based predictive prefetching to tolerate WWW latency , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[13]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[14]  Jason J. Jung Semantic Preprocessing of Web Request Streams for Web Usage Mining , 2005, J. Univers. Comput. Sci..

[15]  Chris H. Q. Ding,et al.  PageRank, HITS and a unified framework for link analysis , 2002, SIGIR '02.

[16]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[17]  Brian D. Davison Predicting web actions from HTML content , 2002, HYPERTEXT '02.

[18]  Paul Barford,et al.  The network effects of prefetching , 1998, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[19]  Brian D. Davison Topical locality in the Web , 2000, SIGIR '00.

[20]  Cheng-Zhong Xu,et al.  A keyword-based semantic prefetching approach in Internet news services , 2004, IEEE Transactions on Knowledge and Data Engineering.

[21]  A. K. Sharma,et al.  Analyzing Anchor-Links to Extract Semantic Inferences of a Web Page , 2007, 10th International Conference on Information Technology (ICIT 2007).

[22]  Mario Cannataro,et al.  The knowledge grid , 2003, CACM.