Using current web page structure to improve prefetching performance

Web prefetching is a technique aimed at reducing user-perceived latencies in the World Wide Web. The spatial locality shown by user accesses makes it possible to predict future accesses from the previous ones. A prefetching engine uses these predictions to prefetch web objects before the user demands them. The existing prediction algorithms achieved an acceptable performance when they were proposed but the high increase in the number of embedded objects per page has reduced their effectiveness in the current web. In this paper, we show that most of the predictions made by the existing algorithms are not useful to reduce the user-perceived latency because these algorithms do not take into account the structure of the current web pages, i.e., an HTML object with several embedded objects. Thus, they predict the accesses to the embedded objects in an HTML after reading the HTML itself. For this reason, the prediction is not made early enough to prefetch the objects and, therefore, there is no latency reduction. In this paper we present the double dependency graph (DDG) algorithm that distinguishes between container objects (HTML) and embedded objects to create a new prediction model according to the structure of the current web. Results show that, for the same number of extra requests to the server, DDG reduces the perceived latency, on average, 40% more than the existing algorithms. Moreover, DDG distributes latency reductions more homogeneously among users.

[1]  Dan Duchamp,et al.  Prefetching Hyperlinks , 1999, USENIX Symposium on Internet Technologies and Systems.

[2]  Ian H. Witten,et al.  Text Compression , 1990, 125 Problems in Text Algorithms.

[3]  Arun Venkataramani,et al.  NPS: A Non-Interfering Deployable Web Prefetching System , 2003, USENIX Symposium on Internet Technologies and Systems.

[4]  Ana Pont,et al.  Web prefetching performance metrics: A survey , 2006, Perform. Evaluation.

[5]  Roy T. Fielding,et al.  Hypertext Transfer Protocol - HTTP/1.1 , 1997, RFC.

[6]  Alexander P. Pons Improving the performance of client Web object retrieval , 2005, J. Syst. Softw..

[7]  Ana Pont,et al.  Modeling continuous changes of the user's dynamic behavior in the WWW , 2005, WOSP '05.

[8]  Xin Chen,et al.  A Popularity-Based Prediction Model for Web Prefetching , 2003, Computer.

[9]  Christos Bouras,et al.  Predictive Prefetching on the Web and Its Potential Impact in the Wide Area , 2004, World Wide Web.

[10]  Zhimin Gu,et al.  An online PPM prediction model for web prefetching , 2007, WIDM '07.

[11]  Wei Lin,et al.  Web prefetching between low-bandwidth clients and proxies: potential and performance , 1999, SIGMETRICS '99.

[12]  Ana Pont,et al.  DDG: An Efficient Prefetching Algorithm for Current Web Generation , 2006, 2006 1st IEEE Workshop on Hot Topics in Web Systems and Technologies.

[13]  Kam-yiu Lam,et al.  Temporal pre-fetching of dynamic web pages , 2006, Inf. Syst..

[14]  Yin-Fu Huang,et al.  Mining web logs to improve hit ratios of prefetching and caching , 2008, Knowl. Based Syst..

[15]  Yannis Manolopoulos,et al.  Prefetching in Content Distribution Networks via Web Communities Identification and Outsourcing , 2007, World Wide Web.

[16]  Jim Griffioen,et al.  Reducing File System Latency using a Predictive Approach , 1994, USENIX Summer.

[17]  Zhimin Gu,et al.  A PPM Prediction Model Based on Stochastic Gradient Descent for Web Prefetching , 2008, 22nd International Conference on Advanced Information Networking and Applications (aina 2008).

[18]  George Pallis,et al.  A clustering-based prefetching scheme on a Web cache environment , 2008, Comput. Electr. Eng..

[19]  Darin Fisher,et al.  Link Prefetching in Mozilla: A Server-Driven Approach , 2003, WCW.

[20]  Yannis Manolopoulos,et al.  A Data Mining Algorithm for Generalized Web Prefetching , 2003, IEEE Trans. Knowl. Data Eng..

[21]  Shen Jun-yi,et al.  A new Markov model for Web access prediction , 2002 .

[22]  Ana Pont,et al.  The Impact of the Web Prefetching Architecture on the Limits of Reducing User's Perceived Latency , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[23]  Ajay D. Kshemkalyani,et al.  Objective-optimal algorithms for long-term Web prefetching , 2006, IEEE Transactions on Computers.

[24]  Ana Pont,et al.  How current web generation affects prediction algorithms performance , 2005 .

[25]  Javed I. Khan,et al.  Exploiting Webspace organization for accelerating Web prefetching , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[26]  Ana Pont,et al.  A user-focused evaluation of web prefetching algorithms , 2007, Comput. Commun..

[27]  Alexander P. Pons Object prefetching using semantic links , 2006, DATB.

[28]  Giovanni Squillero,et al.  Dynamic prediction of Web requests , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[29]  Ana Pont,et al.  An experimental framework for testing Web prefetching techniques , 2004 .

[30]  Jeffrey C. Mogul,et al.  Using predictive prefetching to improve World Wide Web latency , 1996, CCRV.

[31]  Ramesh R. Sarukkai,et al.  Link prediction and path analysis using Markov chains , 2000, Comput. Networks.