A Survey of Web Caching and Prefetching

Web caching and prefetching are the most popular techniques that play a key role in improving the Web performance by keeping web objects that are likely to be visited in the near future closer to the client. Web caching can work independently or integrated with the web prefetching. The Web caching and prefetching can complement each other since the web caching exploits the temporal locality for predicting revisiting requested objects, while the web prefetching utilizes the spatial locality for predicting next related web objects of the requested Web objects. This paper reviews principles and some existing web caching and prefetching approaches. The conventional and intelligent web caching techniques are investigated and discussed. Moreover, Web prefetching techniques are summarized and classified with comparison limitations of these approaches. This paper also presents and discusses some studies that take into consideration impact of integrating both web caching and web prefetching together.

[1]  Jianhui Lin,et al.  Research on WEB Cache Prediction Recommend Mechanism Based on Usage Pattern , 2008, WKDD.

[2]  Cheng-Zhong Xu,et al.  A keyword-based semantic prefetching approach in Internet news services , 2004, IEEE Transactions on Knowledge and Data Engineering.

[3]  Evangelos P. Markatos,et al.  A top- 10 approach to prefetching on the web , 1996 .

[4]  Vir V. Phoha,et al.  An Adaptive Web Cache Access Predictor Using Neural Network , 2002, IEA/AIE.

[5]  Qinghui Liu,et al.  Web latency reduction with prefetching , 2009 .

[6]  Jukka Heikkonen,et al.  Web cache optimization with nonlinear model using object features , 2003, Comput. Networks.

[7]  Jin Yu,et al.  A Survey of Web Prefetching , 2009 .

[8]  Chetan Kumar,et al.  Performance evaluation for implementations of a network of proxy caches , 2009, Decis. Support Syst..

[9]  Minyou Wu,et al.  Web Prefetching : Costs , Benefits and Performance , 2002 .

[10]  George Pallis,et al.  A clustering-based prefetching scheme on a Web cache environment , 2008, Comput. Electr. Eng..

[11]  Toly Chen,et al.  Obtaining the optimal cache document replacement policy for the caching system of an EC website , 2007, Eur. J. Oper. Res..

[12]  Eun Jung Kim,et al.  Adaptive Prefetching Scheme Using Web Log Mining in Cluster-Based Web Systems , 2009, 2009 IEEE International Conference on Web Services.

[13]  José A. Gil,et al.  Evaluation, Analysis and Adaptation of Web Prefetching Techniques in Current Web , 2010 .

[14]  Dimitrios Skoutas,et al.  STAVIES: a system for information extraction from unknown Web data sources through automatic Web wrapper generation using clustering techniques , 2005, IEEE Transactions on Knowledge and Data Engineering.

[15]  Ke Wang,et al.  Building Association-Rule Based Sequential Classifiers for Web-Document Prediction , 2004, Data Mining and Knowledge Discovery.

[16]  Athena Vakali Evolutionary Techniques for Web Caching , 2004, Distributed and Parallel Databases.

[17]  Abdolreza Abhari,et al.  Web object-based storage management in proxy caches , 2006, Future Gener. Comput. Syst..

[18]  Zhimin Gu,et al.  An online PPM prediction model for web prefetching , 2007, WIDM '07.

[19]  Qiang Yang,et al.  Mining web logs for prediction models in WWW caching and prefetching , 2001, KDD '01.

[20]  Yanchun Zhang,et al.  Measuring similarity of interests for clustering web-users , 2001, ADC.

[21]  Ajith Abraham,et al.  Rough Neuro-PSO Web caching and XML prefetching for accessing Facebook from mobile environment , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).

[22]  Josep Doménech i de Soria Evaluation, Analysis and adaptation of web prefetching techniques in current web , 2011 .

[23]  Yong Meng Teo,et al.  Cache pollution in Web proxy servers , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[24]  Manish Parashar,et al.  Optimizing Web Servers Using Page Rank Prefetching for Clustered Accesses , 2004, World Wide Web.

[25]  Ajay D. Kshemkalyani,et al.  Objective-greedy algorithms for long-term Web prefetching , 2004, Third IEEE International Symposium on Network Computing and Applications, 2004. (NCA 2004). Proceedings..

[26]  Ludmila Cherkasova,et al.  Improving WWW Proxies Performance with Greedy-Dual- Size-Frequency Caching Policy , 1998 .

[27]  Yan Chen,et al.  A GA-based cache replacement policy , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[28]  Padhraic Smyth,et al.  Model-Based Clustering and Visualization of Navigation Patterns on a Web Site , 2003, Data Mining and Knowledge Discovery.

[29]  P. Venketesh,et al.  A Survey on Applications of Neural Networks and Evolutionary Techniques in Web Caching , 2009 .

[30]  Yannis Manolopoulos,et al.  A Data Mining Algorithm for Generalized Web Prefetching , 2003, IEEE Trans. Knowl. Data Eng..

[31]  Lifang Xu,et al.  Document Clustering Based on Modified Artificial Immune Network , 2006, RSKT.

[32]  Balachander Krishnamurthy,et al.  Web Protocols and Practice - HTTP/1.1, Networking Protocols, Caching, and Traffic Measurement , 2001 .

[33]  Kin Yeung Wong,et al.  Web cache replacement policies: a pragmatic approach , 2006, IEEE Network.

[34]  Utpal Acharjee Personalized and artificial intelligence Web caching and prefetching , 2006 .

[35]  Beihong Jin,et al.  An Integrated Prefetching and Caching Scheme for Mobile Web Caching System , 2007, Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007).

[36]  Jia Wang,et al.  A survey of web caching schemes for the Internet , 1999, CCRV.

[37]  Gongzhu Hu,et al.  Markov Tree Prediction on Web Cache Prefetching , 2009, Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing.

[38]  Arun Venkataramani,et al.  The potential costs and benefits of long-term prefetching for content distribution , 2002, Comput. Commun..

[39]  Yu Hen Hu,et al.  Logistic Regression in an Adaptive Web Cache , 1999, IEEE Internet Comput..

[40]  Abdolreza Abhari,et al.  Cache replacement solutions by evolutionary computing technique , 2009, SpringSim '09.

[41]  Sam Romano,et al.  Improvement of the neural network proxy cache replacement strategy , 2009, SpringSim '09.

[42]  Jeffrey C. Mogul,et al.  Using predictive prefetching to improve World Wide Web latency , 1996, CCRV.

[43]  Sandy Irani,et al.  Cost-Aware WWW Proxy Caching Algorithms , 1997, USENIX Symposium on Internet Technologies and Systems.

[44]  Tim Hendtlass,et al.  Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence , 2002 .

[45]  Ming-Syan Chen,et al.  Integrating Web Caching and Web Prefetching in Client-Side Proxies , 2005, IEEE Trans. Parallel Distributed Syst..

[46]  Hala ElAarag,et al.  Web proxy cache replacement scheme based on back-propagation neural network , 2008, J. Syst. Softw..

[47]  V. Rao Vemuri,et al.  An artificial immune system approach to document clustering , 2005, SAC '05.

[48]  Michael K. Ng,et al.  A Data Cube Model for Prediction-Based Web Prefetching , 2004, Journal of Intelligent Information Systems.

[49]  Xin Chen,et al.  Popularity-based PPM: an effective Web prefetching technique for high accuracy and low storage , 2002, Proceedings International Conference on Parallel Processing.

[50]  Giannis Tzimas,et al.  A method for personalized clustering in data intensive web applications , 2006, APS '06.

[51]  Chetan Kumar,et al.  A new approach for a proxy-level web caching mechanism , 2008, Decis. Support Syst..

[52]  Themistoklis Palpanas,et al.  Web prefetching using partial match prediction , 1998 .

[53]  Diego Sona,et al.  Clustering documents in a web directory , 2003, WIDM '03.

[54]  Peter Pirolli,et al.  Mining Longest Repeating Subsequences to Predict World Wide Web Surfing , 1999, USENIX Symposium on Internet Technologies and Systems.

[55]  Yin-Fu Huang,et al.  Mining Web logs to improve hit ratios of prefetching and caching , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[56]  S. Sitharama Iyengar,et al.  Adaptive neural network clustering of Web users , 2004, Computer.

[57]  László Böszörményi,et al.  A survey of Web cache replacement strategies , 2003, CSUR.

[58]  Hua Wang,et al.  An integrated model for next page access prediction , 2009, Int. J. Knowl. Web Intell..

[59]  R. Selmic,et al.  Web User Clustering and Its Application to Prefetching Using ART Neural Networks , 2003 .

[60]  Yong Tan,et al.  An admission-control technique for delay reduction in proxy caching , 2009, Decis. Support Syst..

[61]  Beng Chin Ooi,et al.  Rule-assisted prefetching in Web-server caching , 2000, CIKM '00.

[62]  Ana Pont,et al.  Using current web page structure to improve prefetching performance , 2010, Comput. Networks.

[63]  Wolfram Koepf,et al.  Lecture Notes in Computer Science (LNCS) , 2011 .

[64]  Zhongju Zhang,et al.  Periodic cache replacement policy for dynamic content at application server , 2007, Decis. Support Syst..

[65]  Darrell D. E. Long,et al.  Exploring the Bounds of Web Latency Reduction from Caching and Prefetching , 1997, USENIX Symposium on Internet Technologies and Systems.

[66]  Hao Che,et al.  Hierarchical Web caching systems: modeling, design and experimental results , 2002, IEEE J. Sel. Areas Commun..

[67]  Joydeep Ghosh,et al.  A Unified Framework for Model-based Clustering , 2003, J. Mach. Learn. Res..

[68]  Farhan Mohamed Intelligent web caching architecture , 2006 .

[69]  Ajith Abraham,et al.  Intelligent Web Caching Using Neurocomputing and Particle Swarm Optimization Algorithm , 2008, 2008 Second Asia International Conference on Modelling & Simulation (AMS).

[70]  Siti Mariyam Hj. Shamsuddin,et al.  Intelligent Client-Side Web Caching Scheme Based on Least Recently Used Algorithm and Neuro-Fuzzy System , 2009, ISNN.