GlobeCBC: Content-blind Result Caching for Dynamic Web Applications

In this paper, we present GlobeCBC, a content-blind query caching middleware for hosting Web applications in an edge computing infrastructure. Unlike existing data caching middleware systems, GlobeCBC stores the query results independently and does not merge different query results. We study the potential performance of this approach using extensive experimentations on our prototype implementation and compare it with other systems over an emulated wide-area network. Our evaluations show that content-blind caching performs well in terms of client latency for applications that exhibit high locality. It allows the system to sustain higher throughput by offloading the origin server database. We also present the design and evaluation of different online cache replacement algorithms for edge servers that have limited resource capabilities. In our evaluations, we find that the best heuristic must exploit temporal locality and take into account the query execution cost simultaneously.

[1]  Amin Vahdat,et al.  Design and evaluation of a conit-based continuous consistency model for replicated services , 2002, TOCS.

[2]  Bruce M. Maggs,et al.  A Scalability Service for Dynamic Web Applications , 2005, CIDR.

[3]  Arthur J. Bernstein,et al.  Bounded ignorance: a technique for increasing concurrency in a replicated system , 1994, TODS.

[4]  Sriram Padmanabhan,et al.  Scalable template-based query containment checking for Web semantic caches , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[5]  Gustavo Alonso,et al.  GlobeDB: autonomic data replication for web applications , 2005, WWW '05.

[6]  Jonathan Goldstein,et al.  MTCache: Mid-Tier Database Caching for SQL Server , 2004, IEEE Data Eng. Bull..

[7]  Arun Iyengar,et al.  A fragment-based approach for efficiently creating dynamic web content , 2005, TOIT.

[8]  Bruce M. Maggs,et al.  Globally Distributed Content Delivery , 2002, IEEE Internet Comput..

[9]  Sriram Padmanabhan,et al.  DBProxy: a dynamic data cache for web applications , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[10]  Sandy Irani,et al.  Cost-Aware WWW Proxy Caching Algorithms , 1997, USENIX Symposium on Internet Technologies and Systems.

[11]  Hamid Pirahesh,et al.  Adaptive Database Caching with DBCache , 2004, IEEE Data Eng. Bull..

[12]  Gustavo Alonso,et al.  Ganymed: Scalable Replication for Transactional Web Applications , 2004, Middleware.

[13]  Amit Aggarwal,et al.  Computing on the Edge: A Platform for Replicating Internet Applications , 2003, WCW.

[14]  Erik Brynjolfsson,et al.  Consumer Surplus in the Digital Economy: Estimating the Value of Increased Product Variety at Online Booksellers , 2003, Manag. Sci..

[15]  Krithi Ramamritham,et al.  Proxy-based acceleration of dynamically generated content on the world wide web: An approach and implementation , 2004, ACM Trans. Database Syst..

[16]  Wayne D. Smith,et al.  TPC-W: Benchmarking An Ecommerce Solution , 2001 .

[17]  Divyakant Agrawal,et al.  Engineering and hosting adaptive freshness-sensitive web applications on data centers , 2003, WWW '03.

[18]  Michal Szymaniak,et al.  Replication for web hosting systems , 2004, CSUR.

[19]  Guillaume Pierre,et al.  Globule: a User-Centric Content Delivery Network , 2004 .

[20]  Zhen Xiao,et al.  Moving Edge-Side Includes to the Real Edge - the Clients , 2003, USENIX Symposium on Internet Technologies and Systems.

[21]  Jin Zhang,et al.  Active Cache: caching dynamic contents on the Web , 1999, Distributed Syst. Eng..

[22]  Alan L. Cox,et al.  Conflict-Aware Scheduling for Dynamic Content Applications , 2003, USENIX Symposium on Internet Technologies and Systems.

[23]  Divyakant Agrawal,et al.  Partial database replication using epidemic communication , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[24]  Emmanuel Cecchet,et al.  C-JDBC: a Middleware Framework for Database Clustering , 2004, IEEE Data Eng. Bull..

[25]  Ricardo Jiménez-Peris,et al.  Middleware based data replication providing snapshot isolation , 2005, SIGMOD '05.

[26]  Divesh Srivastava,et al.  Semantic Data Caching and Replacement , 1996, VLDB.

[27]  Ellen W. Zegura,et al.  A novel server selection technique for improving the response time of a replicated service , 1998, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[28]  Sriram Padmanabhan,et al.  On space management in a dynamic edge data cache , 2002, WebDB.