Measuring and Evaluating Live Content Consistency in a Large-Scale CDN

Content Delivery Networks (CDNs) play a central role of today's Internet infrastructure, and have seen a sharp increment in scale. More and more internet sites are armed with dynamic (or live) content (such as live sports game statistics, e-commerce and online auction), and there is a need to deliver dynamic content freshly in scale. To achieve high scalability, the consistency maintenance problem for dynamic content (contents with frequent updates) served by CDNs is non-trivial. The large number of widely scattered replicas guarantee the service QoS of end-users, meanwhile largely increase the complexity of consistency maintenance. Current consistency maintenance infrastructures and methods cannot simultaneously satisfy the two requirements: scalability and consistency. In this paper, we first analyze our crawled trace data of a cached sports game content on thousands of content servers of a major CDN. We analyze the content consistency from different perspectives, from which we try to break down the reasons for inconsistency among content servers. Finally, we further evaluate the performance in consistency, scalability and overhead for different infrastructures with different update methods. We itemize the advantages and disadvantages of different methods and infrastructures in different scenarios through the evaluation. We aim to give guidance for appropriate selections of consistency maintenance infrastructures and methods for a CDN, and for choosing a CDN service with different considerations.

[1]  R. Xiang,et al.  Data report: planktonic and benthic foraminifers from IODP Hole U1322B 1 , 2007 .

[2]  Zhongcheng Li,et al.  Locality-Aware Consistency Maintenance for Heterogeneous P2P Systems , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[3]  Mingwei Xu,et al.  An architecture for cache consistency support in information centric networking , 2013, 2013 IEEE Global Communications Conference (GLOBECOM).

[4]  Ben Y. Zhao,et al.  Exploiting locality of interest in online social networks , 2010, CoNEXT.

[5]  Xueyan Tang,et al.  Update Scheduling for Improving Consistency in Distributed Virtual Environments , 2010, IEEE Transactions on Parallel and Distributed Systems.

[6]  Xin Chen,et al.  SCOPE: scalable consistency maintenance in structured P2P systems , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[7]  James Newsome,et al.  GEM: Graph EMbedding for routing and data-centric storage in sensor networks without geographic information , 2003, SenSys '03.

[8]  Young-Jin Kim,et al.  Multi-dimensional range queries in sensor networks , 2003, SenSys '03.

[9]  Yunhao Liu,et al.  Rendered Path: Range-Free Localization in Anisotropic Sensor Networks With Holes , 2007, IEEE/ACM Transactions on Networking.

[10]  Daniel J. Abadi,et al.  CalvinFS: Consistent WAN Replication and Scalable Metadata Management for Distributed File Systems , 2015, FAST.

[11]  Deborah Estrin,et al.  DIFS: a distributed index for features in sensor networks , 2003, Ad Hoc Networks.

[12]  Guohong Cao,et al.  DUP: Dynamic-Tree Based Update Propagation in Peer-to-Peer Networks , 2005, 21st International Conference on Data Engineering (ICDE'05).

[13]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[14]  Kirk Pruhs,et al.  KDDCS: a load-balanced in-network data-centric storage scheme for sensor networks , 2006, CIKM '06.

[15]  Ben Y. Zhao,et al.  OceanStore: an architecture for global-scale persistent storage , 2000, SIGP.

[16]  Deborah Estrin,et al.  GPS-less low-cost outdoor localization for very small devices , 2000, IEEE Wirel. Commun..

[17]  Xenofontas A. Dimitropoulos,et al.  On the 95-Percentile Billing Method , 2009, PAM.

[18]  Wei Hong,et al.  Proceedings of the 5th Symposium on Operating Systems Design and Implementation Tag: a Tiny Aggregation Service for Ad-hoc Sensor Networks , 2022 .

[19]  Jin Xiong,et al.  Metadata Distribution and Consistency Techniques for Large-Scale Cluster File Systems , 2011, IEEE Transactions on Parallel and Distributed Systems.

[20]  Keith W. Ross,et al.  Measuring and Evaluating Large-Scale CDNs , 2008 .

[21]  Ee-Peng Lim,et al.  In-Network Processing of Nearest Neighbor Queries for Wireless Sensor Networks , 2006, DASFAA.

[22]  Hector Garcia-Molina,et al.  Designing a super-peer network , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[23]  Chengjie Liu,et al.  Maintaining Strong Cache Consistency in the World Wide Web , 1998, IEEE Trans. Computers.

[24]  Zongming Fei,et al.  A Novel Approach to Managing Consistency in Content Distribution Networks , 2001 .

[25]  Luís E. T. Rodrigues,et al.  When Scalability Meets Consistency: Genuine Multiversion Update-Serializable Partial Data Replication , 2012, 2012 IEEE 32nd International Conference on Distributed Computing Systems.

[26]  Deborah Estrin,et al.  Networking issues in wireless sensor networks , 2003, J. Parallel Distributed Comput..

[27]  Li Gong,et al.  Industry Report: JXTA: A Network Programming Environment , 2001, IEEE Internet Comput..

[28]  Peter Desnoyers,et al.  TSAR: a two tier sensor storage architecture using interval skip graphs , 2005, SenSys '05.

[29]  Krithi Ramamritham,et al.  Consistency maintenance in peer-to-peer file sharing networks , 2003, Proceedings the Third IEEE Workshop on Internet Applications. WIAPP 2003.

[30]  Chengjie Liu,et al.  Maintaining strong cache consistency in the World-Wide Web , 1997, Proceedings of 17th International Conference on Distributed Computing Systems.

[31]  Deborah Estrin,et al.  Dimensions: why do we need a new data handling architecture for sensor networks? , 2003, CCRV.

[32]  Alon Y. Halevy,et al.  Piazza: data management infrastructure for semantic web applications , 2003, WWW '03.

[33]  Michael J. Freedman,et al.  Stronger Semantics for Low-Latency Geo-Replicated Storage , 2013, NSDI.

[34]  David R. Karger,et al.  Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web , 1997, STOC '97.

[35]  Deborah Estrin,et al.  Directed diffusion: a scalable and robust communication paradigm for sensor networks , 2000, MobiCom '00.

[36]  How many tiers?: pricing in the internet transit market , 2011, SIGCOMM.

[37]  Karl Aberer,et al.  Updates in highly unreliable, replicated peer-to-peer systems , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[38]  Mary Baker,et al.  CUP: Controlled Update Propagation in Peer-to-Peer Networks , 2003, USENIX Annual Technical Conference, General Track.

[39]  David R. Karger,et al.  A scalable location service for geographic ad hoc routing , 2000, MobiCom '00.

[40]  Prashant J. Shenoy,et al.  Cooperative leases: scalable consistency maintenance in content distribution networks , 2002, WWW.

[41]  Li Xiao,et al.  Dynamic Layer Management in Superpeer Architectures , 2005, IEEE Trans. Parallel Distributed Syst..

[42]  Thomas F. La Porta,et al.  Sensor relocation in mobile sensor networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[43]  Mohan Kumar,et al.  A Scalable Asynchronous Cache Consistency Scheme (SACCS) for mobile environments , 2004, IEEE Transactions on Parallel and Distributed Systems.

[44]  Stefano Chessa,et al.  GPS free coordinate assignment and routing in wireless sensor networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[45]  David E. Culler,et al.  Beacon vector routing: scalable point-to-point routing in wireless sensornets , 2005, NSDI.

[46]  Haiying Shen,et al.  A Geographically Aware Poll-Based Distributed File Consistency Maintenance Method for P2P Systems , 2013, IEEE Transactions on Parallel and Distributed Systems.

[47]  Haiying Shen GeWave: Geographically-Aware Wave for File Consistency Maintenance in P2P Systems , 2008, 2008 37th International Conference on Parallel Processing.

[48]  David E. Culler,et al.  Lessons from a Sensor Network Expedition , 2004, EWSN.

[49]  Nuno Diegues,et al.  STI-BT: A Scalable Transactional Index , 2014, IEEE Transactions on Parallel and Distributed Systems.

[50]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[51]  Haiying Shen,et al.  A Lightweight and Cooperative Multifactor Considered File Replication Method in Structured P2P Systems , 2013, IEEE Transactions on Computers.

[52]  Zhongcheng Li,et al.  Efficient and Scalable Consistency Maintenance for Heterogeneous Peer-to-Peer Systems , 2008, IEEE Transactions on Parallel and Distributed Systems.

[53]  Jianliang Xu,et al.  Analysis of TTL-Based Consistency in Unstructured Peer-to-Peer Networks , 2008, IEEE Transactions on Parallel and Distributed Systems.

[54]  Robbert van Renesse,et al.  Leveraging sharding in the design of scalable replication protocols , 2013, SoCC.

[55]  Haiying Shen IRM: Integrated File Replication and Consistency Maintenance in P2P Systems , 2010, IEEE Trans. Parallel Distributed Syst..

[56]  Sanjeev Kumar,et al.  Finding a Needle in Haystack: Facebook's Photo Storage , 2010, OSDI.

[57]  Ramesh Govindan,et al.  Using hierarchical location names for scalable routing and rendezvous in wireless sensor networks , 2004, SenSys '04.

[58]  George Pallis,et al.  Insight and perspectives for content delivery networks , 2006, CACM.

[59]  Jianliang Xu,et al.  A New Storage Scheme for Approximate Location Queries in Object-Tracking Sensor Networks , 2008, IEEE Transactions on Parallel and Distributed Systems.

[60]  Xueyan Tang,et al.  Optimal Replica Placement under TTL-Based Consistency , 2007, IEEE Transactions on Parallel and Distributed Systems.

[61]  Jun Wang,et al.  TRAID: Exploiting Temporal Redundancy and Spatial Redundancy to Boost Transaction Processing Systems Performance , 2012, IEEE Transactions on Computers.

[62]  Magnus Karlsson,et al.  Turning heterogeneity into an advantage in overlay routing , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[63]  Paul Renaud-Goud,et al.  Optimal Algorithms and Approximation Algorithms for Replica Placement with Distance Constraints in Tree Networks , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[64]  Prashant J. Shenoy,et al.  Scalable Consistency Maintenance in Content Distribution Networks Using Cooperative Leases , 2003, IEEE Trans. Knowl. Data Eng..

[65]  Shueng-Han Gary Chan,et al.  VMesh: Distributed Segment Storage for Peer-to-Peer Interactive Video Streaming , 2007, IEEE Journal on Selected Areas in Communications.

[66]  Thomas F. La Porta,et al.  Data dissemination with ring-based index for wireless sensor networks , 2003, 11th IEEE International Conference on Network Protocols, 2003. Proceedings..

[67]  Adrian Perrig,et al.  ECO-DNS: Expected Consistency Optimization for DNS , 2015, 2015 IEEE 35th International Conference on Distributed Computing Systems.

[68]  Ramesh K. Sitaraman,et al.  The Akamai network: a platform for high-performance internet applications , 2010, OPSR.

[69]  David Evans,et al.  Localization for mobile sensor networks , 2004, MobiCom '04.

[70]  Hui Ding,et al.  TAO: Facebook's Distributed Data Store for the Social Graph , 2013, USENIX Annual Technical Conference.