DeepChunk: Deep Q-Learning for Chunk-Based Caching in Wireless Data Processing Networks

A Data Processing Network (DPN) streams massive volumes of data collected and stored by the network to multiple processing units to compute desired results in a timely fashion. Due to ever-increasing traffic, distributed cache nodes can be deployed to store hot data and rapidly deliver them for consumption. However, prior work on caching policies has primarily focused on the potential gains in network performance, e.g., cache hit ratio and download latency, while neglecting the impact of cache on data processing and consumption. In this paper, we propose a novel framework, DeepChunk, which leverages deep Q-learning for chunk-based caching in wireless DPN. We show that cache policies must be optimized for both network performance during data delivery and processing efficiency during data consumption. Specifically, DeepChunk utilizes a model-free approach by jointly learning limited network, data streaming, and processing statistics at runtime and making cache update decisions under the guidance of deep Q-learning. It enables a joint optimization of multiple objectives including chunk hit ratio, processing stall time, and object download time while being self-adaptive under the time-varying workload and network conditions. We build a prototype implementation of DeepChunk with Ceph, a popular distributed object storage system. Based on real-world Wifi and 4G traces, our extensive experiments and evaluation demonstrate significant improvement, i.e., 52% increase in total reward and 68% decrease in processing stall time, over a number of baseline caching policies.

[1]  Vaneet Aggarwal,et al.  Generalization of LRU Cache Replacement Policy with Applications to Video Streaming , 2018, ACM Trans. Model. Perform. Evaluation Comput. Syst..

[2]  Vaneet Aggarwal,et al.  FastTrack: Minimizing Stalls for CDN-based Over-the-top Video Streaming Systems , 2018, ArXiv.

[3]  Jie Wu,et al.  Dache: A data aware caching for big-data applications using the MapReduce framework , 2013, 2013 Proceedings IEEE INFOCOM.

[4]  Vaneet Aggarwal,et al.  Coded Caching With Distributed Storage , 2019, IEEE Transactions on Information Theory.

[5]  Michele Garetto,et al.  A unified approach to the performance analysis of caching systems , 2013, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[6]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[7]  Rajkumar Buyya,et al.  Distributed data stream processing and edge computing: A survey on resource elasticity and future directions , 2017, J. Netw. Comput. Appl..

[8]  Lada A. Adamic,et al.  Zipf's law and the Internet , 2002, Glottometrics.

[9]  Yongbo Li,et al.  MobiQoR: Pushing the Envelope of Mobile Edge Computing Via Quality-of-Result Optimization , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[10]  Vaneet Aggarwal,et al.  DeepPool: Distributed Model-Free Algorithm for Ride-Sharing Using Deep Reinforcement Learning , 2019, IEEE Transactions on Intelligent Transportation Systems.

[11]  Ramesh K. Sitaraman,et al.  AdaptSize: Orchestrating the Hot Object Memory Cache in a Content Delivery Network , 2017, NSDI.

[12]  Gustavo Alonso,et al.  Analysis of Caching and Replication Strategies for Web Applications , 2007, IEEE Internet Computing.

[13]  Sujit Dey,et al.  Video-Aware Scheduling and Caching in the Radio Access Network , 2014, IEEE/ACM Transactions on Networking.

[14]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[15]  Ronald Fagin,et al.  Asymptotic Miss Ratios over Independent References , 1977, J. Comput. Syst. Sci..

[16]  Klaus Moessner,et al.  Dynamic Heterogeneous Learning Games for Opportunistic Access in LTE-Based Macro/Femtocell Deployments , 2015, IEEE Transactions on Wireless Communications.

[17]  Vaneet Aggarwal,et al.  LBP: Robust Rate Adaptation Algorithm for SVC Video Streaming , 2018, IEEE/ACM Transactions on Networking.

[18]  Yongbo Li,et al.  Capitalizing on the Promise of Ad Prefetching in Real-World Mobile Systems , 2017, 2017 IEEE 14th International Conference on Mobile Ad Hoc and Sensor Systems (MASS).

[19]  Mehdi Bennis,et al.  Living on the edge: The role of proactive caching in 5G wireless networks , 2014, IEEE Communications Magazine.

[20]  Dennis Shasha,et al.  2Q: A Low Overhead High Performance Buffer Management Replacement Algorithm , 1994, VLDB.

[21]  Michele Garetto,et al.  A unified approach to the performance analysis of caching systems , 2014, INFOCOM.

[22]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[23]  Yu Xiang,et al.  Sprout: A Functional Caching Approach to Minimize Service Latency in Erasure-Coded Storage , 2016, IEEE/ACM Transactions on Networking.

[24]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[25]  Zhiyuan Xu,et al.  Model-free Control for Distributed Stream Data Processing using Deep Reinforcement Learning , 2018, Proc. VLDB Endow..

[26]  Carlos Maltzahn,et al.  Ceph: a scalable, high-performance distributed file system , 2006, OSDI '06.

[27]  Yongbo Li,et al.  Multichoice Games for Optimizing Task Assignment in Edge Computing , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[28]  Sujit Dey,et al.  Hierarchical video caching in wireless cloud: Approaches and algorithms , 2012, 2012 IEEE International Conference on Communications (ICC).

[29]  Dilip Kumar Krishnappa,et al.  On the Feasibility of Prefetching and Caching for Online TV Services: A Measurement Study on Hulu , 2011, PAM.

[30]  Nathan Beckmann,et al.  LHD: Improving Cache Hit Rate by Maximizing Hit Density , 2018, NSDI.

[31]  Tom Schaul,et al.  Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[32]  Peter Stone,et al.  Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.

[33]  Muhammad Ali Imran,et al.  A Cell Outage Management Framework for Dense Heterogeneous Networks , 2016, IEEE Transactions on Vehicular Technology.

[34]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[35]  Alfred V. Aho,et al.  Principles of Optimal Page Replacement , 1971, J. ACM.

[36]  Filip De Turck,et al.  HTTP/2-Based Adaptive Streaming of HEVC Video Over 4G/LTE Networks , 2016, IEEE Communications Letters.

[37]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[38]  Vaneet Aggarwal,et al.  Multi-Tier Caching Analysis in CDN-Based Over-the-Top Video Streaming Systems , 2019, IEEE/ACM Transactions on Networking.

[39]  Yongbo Li,et al.  A Reinforcement Learning Approach for Online Service Tree Placement in Edge Computing , 2019, 2019 IEEE 27th International Conference on Network Protocols (ICNP).

[40]  Michael Dory,et al.  Introduction to Tornado , 2012 .

[41]  Roy D. Yates,et al.  Real-time status: How often should one update? , 2012, 2012 Proceedings IEEE INFOCOM.

[42]  Beng Chin Ooi,et al.  The performance of MapReduce , 2010, Proc. VLDB Endow..

[43]  Ikjun Yeom,et al.  Performance analysis of in-network caching for content-centric networking , 2013, Comput. Networks.

[44]  Hao Che,et al.  Hierarchical Web caching systems: modeling, design and experimental results , 2002, IEEE J. Sel. Areas Commun..

[45]  Chi Harold Liu,et al.  Experience-driven Networking: A Deep Reinforcement Learning based Approach , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.