Content Delivery Networks: State of the Art, Insights, and Imperatives

Over the last decades, users have witnessed the growth and maturity of the Internet which has caused enormous growth in network traffic, driven by the rapid acceptance of broadband access, the increases in systems complexity, and the content richness. The over-evolving nature of the Internet brings new challenges in managing and delivering content to users, since for example, popular Web services often suffer congestion and bottlenecks due to the large demands posed on their services. Such a sudden spike in Web content requests (e.g. the one occurred during the 9/11 incident in USA) is often termed as flash crowds [14] or SlashDot [11] effects. It may cause heavy workload on particular Web server(s), and as a result a “hotspot” [14] can be generated. Coping with such unexpected demand causes significant strain on a Web server and eventually the Web servers are totally overwhelmed with the sudden increase in traffic, and the Web site holding the content becomes temporarily unavailable. A Content Delivery Network (CDN) [47, 51, 54, 61, 63] is a collaborative collection of network elements spanning the Internet, where content is replicated over several mirrored Web servers in order to perform transparent and effective delivery of content to the end users. Collaboration among distributed CDN components can occur over nodes in both homogeneous and heterogeneous environments. CDNs have evolved to overcome the inherent limitations of the Internet in terms of user perceived Quality of Service (QoS) when accessing Web content. They provide services that improve network performance by maximizing bandwidth, improving

[1]  Guillaume Urvoy-Keller,et al.  From content distribution networks to content networks - issues and challenges , 2006, Comput. Commun..

[2]  Gary Scott Malkin,et al.  Traceroute Using an IP Option , 1993, RFC.

[3]  Alexander S. Szalay,et al.  The world-wide telescope , 2001, CACM.

[4]  George Pallis,et al.  Content Delivery Networks: Status and Trends , 2003, IEEE Internet Comput..

[5]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[6]  C. Linnhoff-Popien,et al.  A profit maximizing distribution strategy for context-aware services , 2005, Second IEEE International Workshop on Mobile Commerce and Services.

[7]  Yakov Rekhter,et al.  A Border Gateway Protocol 4 (BGP-4) , 1994, RFC.

[8]  Robert Cole,et al.  Computer Communications , 1982, Springer New York.

[9]  George Pallis,et al.  Insight and perspectives for content delivery networks , 2006, CACM.

[10]  Guillaume Pierre,et al.  Globule: a collaborative content delivery network , 2006, IEEE Communications Magazine.

[11]  Wei-Ying Ma,et al.  Content Services Network: The Architecture and Protocols , 2001 .

[12]  Krishna P. Gummadi,et al.  An analysis of Internet content delivery systems , 2002, OPSR.

[13]  Rajkumar Buyya,et al.  An architecture for virtual organization (VO)-based effective peering of content delivery networks , 2007, UPGRADE '07.

[14]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[15]  Ian Cooper,et al.  Internet Web Replication and Caching Taxonomy , 2001, RFC.

[16]  Fred Douglis,et al.  CDN brokering , 2002, Comput. Commun..

[17]  Mikel Izal,et al.  Dissecting BitTorrent: Five Months in a Torrent's Lifetime , 2004, PAM.

[18]  Dinesh C. Verma,et al.  Content Distribution Networks , 2002 .

[19]  Gang Peng,et al.  CDN: Content Distribution Network , 2004, ArXiv.

[20]  Arun Venkataramani,et al.  The potential costs and benefits of long-term prefetching for content distribution , 2002, Comput. Commun..

[21]  Hilarie Orman,et al.  Policy, Authorization, and Enforcement Requirements of the Open Pluggable Edge Services (OPES) , 2004, RFC.

[22]  Larry L. Peterson,et al.  The dark side of the Web , 2004, Comput. Commun. Rev..

[23]  Larry L. Peterson,et al.  Reliability and Security in the CoDeeN Content Distribution Network , 2004, USENIX Annual Technical Conference, General Track.

[24]  Diomidis Spinellis,et al.  A survey of peer-to-peer content distribution technologies , 2004, CSUR.

[25]  Jussi Kangasharju,et al.  Object replication strategies in content distribution networks , 2002, Comput. Commun..

[26]  Rajkumar Buyya,et al.  Content Delivery Networks , 2008 .

[27]  Markus Hofmann,et al.  Content Networking - Architecture, Protocols, and Practice , 2005, The Morgan Kaufmann series in networking.

[28]  James F. Doyle,et al.  Peer-to-Peer: harnessing the power of disruptive technologies , 2001, UBIQ.

[29]  Andrew B. Whinston,et al.  Scaling Web Services with Capacity Provision Networks , 2003, Computer.

[30]  Rajkumar Buyya,et al.  Decentralized media streaming infrastructure (DeMSI): An adaptive and high-performance peer-to-peer content delivery network , 2006, J. Syst. Archit..

[31]  KyoungSoo Park,et al.  Scale and Performance in the CoBlitz Large-File Distribution Service , 2006, NSDI.

[32]  Dinesh C. Verma,et al.  Content Distribution Networks: An Engineering Approach , 2001 .

[33]  P. Lebrun THE LARGE HADRON COLLIDER, A MEGASCIENCE PROJECT , 2000 .

[34]  Balachander Krishnamurthy,et al.  Flash crowds and denial of service attacks: characterization and implications for CDNs and web sites , 2002, WWW.

[35]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[36]  Data processing-open systems interconnection - basic reference model , 1981, CCRV.

[37]  I. Lazar,et al.  Exploring content delivery networking , 2001 .

[38]  Zhen Xiao,et al.  Moving Edge-Side Includes to the Real Edge - the Clients , 2003, USENIX Symposium on Internet Technologies and Systems.

[39]  Gustavo Alonso,et al.  Analysis of Caching and Replication Strategies for Web Applications , 2007, IEEE Internet Computing.

[40]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[41]  Carlos E. Palau,et al.  Modeling content delivery networks and their performance , 2004, Comput. Commun..

[42]  Brad Cain,et al.  A Model for Content Internetworking (CDI) , 2003, RFC.

[43]  Reinaldo Penno,et al.  An Architecture for Open Pluggable Edge Services (OPES) , 2004, RFC.

[44]  Stefano Ceri,et al.  Distributed Databases: Principles and Systems , 1984 .

[45]  Rajkumar Buyya,et al.  A taxonomy of Data Grids for distributed data sharing, management, and processing , 2005, CSUR.

[46]  Bruce M. Maggs,et al.  Globally Distributed Content Delivery , 2002, IEEE Internet Comput..

[47]  Jussara M. Almeida,et al.  Dynamic content distribution for mobile enterprise networks , 2005, IEEE Journal on Selected Areas in Communications.

[48]  Novella Bartolini,et al.  A Walk through Content Delivery Networks , 2003, MASCOTS Tutorials.

[49]  Michiaki Tatsubori,et al.  Efficient Web services response caching by selecting optimal data representation , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[50]  David Mazières,et al.  Democratizing Content Publication with Coral , 2004, NSDI.

[51]  Reagan Moore,et al.  Data-intensive computing and digital libraries , 1998, CACM.