Autonomic Data Placement Strategies for Update-intensiveWeb applications

Edge computing infrastructures have become the leading platform for hosting Web applications. One of the key challenges in these infrastructures is the replication of application data. In our earlier research, we presented GlobeDB, a middleware for edge computing infrastructures that performs autonomic replication of application data. In this paper, we study the problem of data unit placement for update-intensive Web applications in the context of GlobeDB. Our hypothesis is that there exists a continuous spectrum of placement choices between complete partitioning of sets of data units across edge servers and full replication of data units to all servers. We propose and evaluate different families of heuristics for this problem of replica placement. As we show in our experiments, a heuristic that takes into account both the individual characteristics of data units and the overall system load performs best

[1]  Jonathan Goldstein,et al.  MTCache: transparent mid-tier database caching in SQL server , 2004, Proceedings. 20th International Conference on Data Engineering.

[2]  Divyakant Agrawal,et al.  Engineering and hosting adaptive freshness-sensitive web applications on data centers , 2003, WWW '03.

[3]  CachingLee,et al.  On the Implications of Zipf ' s Law for Web , 1998 .

[4]  John I. McCool,et al.  Probability and Statistics With Reliability, Queuing and Computer Science Applications , 2003, Technometrics.

[5]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[6]  Andrew S. Tanenbaum,et al.  Dynamically Selecting Optimal Distribution Strategies for Web Documents , 2002, IEEE Trans. Computers.

[7]  Suresha,et al.  Proxy-based acceleration of dynamically generated content on the world wide web: an approach and implementation , 2002, SIGMOD '02.

[8]  Jianliang Xu,et al.  On replica placement for QoS-aware content distribution , 2004, IEEE INFOCOM 2004.

[9]  Giovanni Pacifici Proceedings of the 2nd international workshop on Advanced architectures and algorithms for internet delivery and applications , 2006 .

[10]  Alexandros Labrinidis,et al.  Update Propagation Strategies for Improving the Quality of Data on the Web , 2001, VLDB.

[11]  Lili Qiu,et al.  The content and access dynamics of a busy Web site: findings and implications , 2000 .

[12]  Gustavo Alonso,et al.  GlobeDB: autonomic data replication for web applications , 2005, WWW '05.

[13]  Hamid Pirahesh,et al.  Adaptive Database Caching with DBCache , 2004, IEEE Data Eng. Bull..

[14]  Khalil Amiri,et al.  Exploiting templates to scale consistency maintenance in edge database caches , 2003 .

[15]  Venkata N. Padmanabhan,et al.  The content and access dynamics of a busy web site: findings and implicatins , 2000, SIGCOMM.

[16]  Jonathan Goldstein,et al.  MTCache: Mid-Tier Database Caching for SQL Server , 2004, IEEE Data Eng. Bull..

[17]  Arun Iyengar,et al.  A fragment-based approach for efficiently creating dynamic web content , 2005, TOIT.

[18]  Michal Szymaniak,et al.  Latency-driven replica placement , 2005, The 2005 Symposium on Applications and the Internet.

[19]  Bruce M. Maggs,et al.  Globally Distributed Content Delivery , 2002, IEEE Internet Comput..