Minimizing Latency in Serving Requests through Differential Template Caching in a Cloud

In Software-as-a-Service (SaaS) cloud delivery model, a hosting center deploys a Virtual Machine (VM) image template on a server on demand. Image templates are usually maintained in a central repository. With geographically dispersed hosting centers, time to transfer a large, often GigaByte sized, template file from the repository faces high latency due to low Internet bandwidth. An architecture that maintains a template cache, collocated with the hosting centers, can reduce request service latency. Since templates are large in size, caching complete templates is prohibitive in terms of storage space. In order to optimize cache space requirement, as well as, to reduce transfers from the repository, we propose a differential template caching technique, called DiffCache. A difference file or a patch between two templates, that have common components, is small in size. DiffCache computes an optimal selection of templates and patches based on the frequency of requests for specific templates. A template missing in the cache can be generated if any cached template can be patched with a cached patch file, thereby saving the transfer time from the repository at the cost of relatively small patching time. We show that patch based caching coupled with intelligent population of the cache can lead to a 90% improvement in service request latency when compared with caching only template files.

[1]  Ethan L. Miller,et al.  The effectiveness of deduplication on virtual machine disk images , 2009, SYSTOR '09.

[2]  Monica S. Lam,et al.  The collective: a cache-based system management architecture , 2005, NSDI.

[3]  Bruce M. Maggs,et al.  Globally Distributed Content Delivery , 2002, IEEE Internet Comput..

[4]  Sebastien Goasguen,et al.  Image Distribution Mechanisms in Large Scale Cloud Providers , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[5]  Yu Chen,et al.  Machine Bank: Own Your Virtual Personal Computer , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[6]  Chunyi Peng,et al.  An empirical analysis of similarity in virtual machine images , 2011, Middleware '11.

[7]  Stefan Berger,et al.  RC2 - A Living Lab for Cloud Computing , 2010, LISA.

[8]  Jianhui Lin,et al.  Research on WEB Cache Prediction Recommend Mechanism Based on Usage Pattern , 2008, WKDD.

[9]  Yang Chao,et al.  Research on WEB Cache Prediction Recommend Mechanism Based on Usage Pattern , 2008, First International Workshop on Knowledge Discovery and Data Mining (WKDD 2008).

[10]  Ying Chen,et al.  Rapid Provisioning of Cloud Infrastructure Leveraging Peer-to-Peer Networks , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems Workshops.

[11]  Darrell Reimer,et al.  Virtual Machine Images as Structured Data: The Mirage Image Library , 2011, HotCloud.