Learning-based Dynamic Cache Management in a Cloud

Caches are an important component of modern computing systems given their significant impact on performance. In particular, caches play a key role in the cloud due to the nature of large-scale, data-intensive processing. One of the key challenges for the cloud providers is how to share the caching capacity among tenants, under the circumstance that each often requires a different degree of quality of service (QoS) with respect to data access performance. The invariant is that the individual tenants' QoS requirements should be satisfied while the cache usage is optimized in a system-wide manner. In this paper, we introduce a learning-based approach for dynamic cache management in a cloud, which is based on the estimation of data access pattern of a tenant and the prediction of cache performance for the access pattern in question. We consider a variety of probability distributions to estimate the data access pattern, and examine a set of learning-based regression techniques to predict the cache hit rate for the access pattern. The predicted cache hit rate is then used to make a decision whether reallocating cache space is needed to meet the QoS requirement for the tenant. Our experimental results with an extensive set of synthetic traces and the YCSB benchmark show that the proposed method consistently optimizes the cache space while satisfying the QoS requirement.

[1]  Haifeng Chen,et al.  Proactive Workload Management in Hybrid Cloud Computing , 2014, IEEE Transactions on Network and Service Management.

[2]  Alexandros G. Dimakis,et al.  Femtocaching and device-to-device collaboration: A new architecture for wireless video distribution , 2012, IEEE Communications Magazine.

[3]  Wei Wang,et al.  OpuS: Fair and Efficient Cache Sharing for In-Memory Data Analytics , 2018, 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS).

[4]  Jinoh Kim,et al.  Unsupervised Labeling for Supervised Anomaly Detection in Enterprise and Cloud Networks , 2017, 2017 IEEE 4th International Conference on Cyber Security and Cloud Computing (CSCloud).

[5]  Ymir Vigfusson,et al.  Design and implementation of caching services in the cloud , 2011, IBM J. Res. Dev..

[6]  George C. Polyzos,et al.  GlobeTraff: A Traffic Workload Generator for the Performance Evaluation of Future Internet Architectures , 2012, 2012 5th International Conference on New Technologies, Mobility and Security (NTMS).

[7]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[8]  Songqing Chen,et al.  The stretched exponential distribution of internet media access patterns , 2008, PODC '08.

[9]  Min Chen,et al.  Green and Mobility-Aware Caching in 5G Networks , 2017, IEEE Transactions on Wireless Communications.

[10]  Mariacarla Calzarossa,et al.  Workload Characterization , 2016, ACM Comput. Surv..

[11]  Gregory Chockler,et al.  Data caching as a cloud service , 2010, LADIS '10.

[12]  David A. Padua,et al.  Calculating stack distances efficiently , 2002, MSP/ISMM.

[13]  Ludmila Cherkasova,et al.  Analysis of enterprise media server workloads: access patterns, locality, content evolution, and rates of change , 2004, IEEE/ACM Transactions on Networking.

[14]  Alexandros G. Dimakis,et al.  Scaling Behavior for Device-to-Device Communications With Distributed Caching , 2014, IEEE Transactions on Information Theory.

[15]  Krishna P. Gummadi,et al.  Measurement, modeling, and analysis of a peer-to-peer file-sharing workload , 2003, SOSP '03.

[16]  Donald F. Towsley,et al.  Optimal proxy cache allocation for efficient streaming media distribution , 2002, IEEE Transactions on Multimedia.

[17]  Jun Zhang,et al.  Network Traffic Classification Using Correlation Information , 2013, IEEE Transactions on Parallel and Distributed Systems.

[18]  Xu Du,et al.  Internal popularity of streaming video and its implication on caching , 2006, 20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06).

[19]  Muttukrishnan Rajarajan,et al.  Privacy-Preserving Multi-Class Support Vector Machine for Outsourcing the Data Classification in Cloud , 2014, IEEE Transactions on Dependable and Secure Computing.

[20]  Ali Ghodsi,et al.  FairRide: Near-Optimal, Fair Cache Sharing , 2016, NSDI.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Andrew Warfield,et al.  Characterizing Storage Workloads with Counter Stacks , 2014, OSDI.

[23]  Sujit Dey,et al.  Video caching in Radio Access Network: Impact on delay and capacity , 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC).

[24]  Jaesik Choi,et al.  Relational Dynamic Bayesian Networks with Locally Exchangeable Measures , 2013 .

[25]  Meng Chang Chen,et al.  A Workload Analysis of Live Event Broadcast Service in Cloud , 2013, ANT/SEIT.

[26]  Anshul Gandhi,et al.  ElMem: Towards an Elastic Memcached System , 2018, 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS).

[27]  Brad Fitzpatrick,et al.  Distributed caching with memcached , 2004 .

[28]  Niklas Carlsson,et al.  Dynamic content allocation for cloud-assisted service of periodic workloads , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[29]  Lakshmish Ramaswamy,et al.  Cache Clouds: Cooperative Caching of Dynamic Documents in Edge Networks , 2005, 25th IEEE International Conference on Distributed Computing Systems (ICDCS'05).

[30]  Sachin Katti,et al.  Dynacache: Dynamic Cloud Caching , 2015, HotStorage.

[31]  Dongbing Gu,et al.  Spatial Gaussian Process Regression With Mobile Sensor Networks , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[32]  M LevyHenry,et al.  Measurement, modeling, and analysis of a peer-to-peer file-sharing workload , 2003 .

[33]  Adam Silberstein,et al.  Benchmarking cloud serving systems with YCSB , 2010, SoCC '10.