Efficiency of caches for content distribution on the Internet

Traffic engineering and an economical provisioning of bandwidth is crucial for network providers in times of high competition in broadband access networks. We investigate the efficiency of caching as an option to shorten end-to-end paths and delays while at the same time reducing traffic loads. The portion of HTTP based distribution of cacheable content on the Internet is increasing in recent time. In addition, the favourable effect of Zipf-like access pattern on caches is also confirmed for currently most popular web sites with user generated content. Content delivery (CDN) and peer-to-peer (P2P) networks are distributing a major portion of IP traffic with different impact on caching. P2P traffic is subject to long transport paths although appropriate for caching in principle. CDNs are based on server infrastructures allowing for shorter paths on a global scale on top of network provider platforms. We give a brief overview of the options for deploying caches by content and network providers at different points in the interconnection, backbone or aggregation. The main part of the work focuses on the analysis of replacement strategies with regard to Zipf-like and fixed or slowly varying access pattern. A comparative evaluation shows that least recently used (LRU) essentially differs from caching strategies based on access statistics in terms of the achievable hit rates.

[1]  David R. Choffnes,et al.  Drafting Behind Akamai: Inferring Network Conditions Based on CDN Redirections , 2009, IEEE/ACM Transactions on Networking.

[2]  Cheng Huang,et al.  Understanding hybrid CDN-P2P: why limelight needs its own Red Swoosh , 2008, NOSSDAV.

[3]  Jiangchuan Liu,et al.  Statistics and Social Network of YouTube Videos , 2008, 2008 16th Interntional Workshop on Quality of Service.

[4]  Joachim Charzinski,et al.  Traffic Properties, Client Side Cachability and CDN Usage of Popular Web Sites , 2010, MMB/DFT.

[5]  Gerhard Haßlinger ISP Platforms Under a Heavy Peer-to-Peer Workload , 2005, Peer-to-Peer Systems and Applications.

[6]  Matteo Sereno,et al.  A measurement study supporting P2P file-sharing community models , 2009, Comput. Networks.

[7]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[8]  David Tse,et al.  Probabilistic methods for web caching , 2001, Perform. Evaluation.

[9]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[10]  Abraham Silberschatz,et al.  P4p: provider portal for applications , 2008, SIGCOMM '08.

[11]  G. Barish,et al.  World Wide Web caching: trends and techniques , 2000, IEEE Commun. Mag..

[12]  Aleksandar Kuzmanovic,et al.  Drafting behind Akamai (travelocity-based detouring) , 2006, SIGCOMM 2006.

[13]  Asit Dan,et al.  An approximate analysis of the LRU and FIFO buffer replacement schemes , 1990, SIGMETRICS '90.

[14]  Mohamed Hefeeda,et al.  Modeling and Caching of Peer-to-Peer Traffic , 2006, Proceedings of the 2006 IEEE International Conference on Network Protocols.

[15]  Christian Scheideler,et al.  Can ISPS and P2P users cooperate for improved performance? , 2007, CCRV.

[16]  Gerhard Haßlinger,et al.  Measurement and Characteristics of Aggregated Traffic in Broadband Access Networks , 2007, International Teletraffic Congress.

[17]  Terence Kelly,et al.  Design, Implementation, and Evaluation of Duplicate Transfer Detection in HTTP , 2004, NSDI.

[18]  Zongpeng Li,et al.  Youtube traffic characterization: a view from the edge , 2007, IMC '07.

[19]  Songqing Chen,et al.  Does internet media traffic really follow Zipf-like distribution? , 2007, SIGMETRICS '07.

[20]  Kensuke Fukuda,et al.  The impact and implications of the growth in residential user-to-user traffic , 2006, SIGCOMM 2006.

[21]  Andrew Odlyzko,et al.  Internet traffic growth: sources and implications , 2003, SPIE ITCom.

[22]  Anja Feldmann,et al.  Revisiting Cacheability in Times of User Generated Content , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[23]  Anirban Mahanti,et al.  Traffic analysis of a Web proxy caching hierarchy , 2000 .

[24]  I. Lazar,et al.  The state of the Internet , 2000 .

[25]  Erol Gelenbe,et al.  A Unified Approach to the Evaluation of a Class of Replacement Algorithms , 1973, IEEE Transactions on Computers.

[26]  Gerhard Haßlinger,et al.  User Access to Popular Data on the Internet and Approaches for IP Traffic Flow Optimization , 2009, ASMTA.