Building a semi intelligent web cache with light weight machine learning

This paper proposes a novel admission and replacement technique for web caching, which utilizes the multinomial logistic regression (MLR) as classifier. The MLR model is trained for classifying the web cache's object worthiness. The parameter object worthiness is a polytomous (discrete) variable which depends on the traffic and the object properties. Using worthiness as a key, an adaptive caching model is proposed. Trace driven simulations are used to evaluate the performance of the scheme. Test results show that a properly trained MLR model yields good cache performance in terms of hit ratios and disk space utilization, making the proposed scheme as a viable semi intelligent caching scheme.

[1]  Azer Bestavros,et al.  Popularity-aware greedy dual-size Web proxy caching algorithms , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[2]  Qiang Yang,et al.  Web-Log Mining for Predictive Web Caching , 2003, IEEE Trans. Knowl. Data Eng..

[3]  László Böszörményi,et al.  A survey of Web cache replacement strategies , 2003, CSUR.

[4]  Yong Tan,et al.  An admission-control technique for delay reduction in proxy caching , 2009, Decis. Support Syst..

[5]  Hala ElAarag,et al.  Web proxy cache replacement scheme based on back-propagation neural network , 2008, J. Syst. Softw..

[6]  Lev N. Shchur,et al.  On the universality of rank distributions of website popularity , 2004, Comput. Networks.

[7]  Ravi Kumar,et al.  Self-similarity in the web , 2001, TOIT.

[8]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[9]  Vir V. Phoha,et al.  An Adaptive Web Cache Access Predictor Using Neural Network , 2002, IEA/AIE.

[10]  Yu Hen Hu,et al.  Logistic Regression in an Adaptive Web Cache , 1999, IEEE Internet Comput..

[11]  Keqiu Li,et al.  A Minimal Access Cost-Based Multimedia Object Replacement Algorithm , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[12]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[13]  Ajith Abraham,et al.  Intelligent Web Caching Using Neurocomputing and Particle Swarm Optimization Algorithm , 2008, 2008 Second Asia International Conference on Modelling & Simulation (AMS).

[14]  Sandy Irani,et al.  Cost-Aware WWW Proxy Caching Algorithms , 1997, USENIX Symposium on Internet Technologies and Systems.

[15]  Hao Chen,et al.  A Least Grade Page Replacement Algorithm for Web Cache Optimization , 2008, First International Workshop on Knowledge Discovery and Data Mining (WKDD 2008).

[16]  Xin Chen,et al.  A Popularity-Based Prediction Model for Web Prefetching , 2003, Computer.