Caching Transient Data for Internet of Things: A Deep Reinforcement Learning Approach

Connected devices in Internet-of-Things (IoT) continuously generate enormous amount of data, which is transient and would be requested by IoT application users, such as autonomous vehicles. Transmitting IoT data through wireless networks would lead to congestions and long delays, which can be tackled by caching IoT data at the network edge. However, it is challenging to jointly consider IoT data-transiency and dynamic context characteristics. In this paper, we advocate the use of deep reinforcement learning (DRL) to solve the problem of caching IoT data at the edge without knowing future IoT data popularity, user request pattern, and other context characteristics. By defining data freshness metrics, the aim of determining IoT data caching policy is to strike a balance between the communication cost and the loss of data freshness. Extensive simulation results corroborate that the proposed DRL-based IoT data caching policy outperforms other baseline policies.

[1]  Ralf Tönjes,et al.  A Comprehensive Ontology for Knowledge Representation in the Internet of Things , 2012, 2012 IEEE 11th International Conference on Trust, Security and Privacy in Computing and Communications.

[2]  Tao Zhang,et al.  Fog and IoT: An Overview of Research Opportunities , 2016, IEEE Internet of Things Journal.

[3]  Victor C. M. Leung,et al.  Developing IoT applications in the Fog: A Distributed Dataflow approach , 2015, 2015 5th International Conference on the Internet of Things (IOT).

[4]  M. Nitti,et al.  Exploiting Social Internet of Things Features in Cognitive Radio , 2016, IEEE Access.

[5]  Nirwan Ansari,et al.  EdgeIoT: Mobile Edge Computing for the Internet of Things , 2016, IEEE Communications Magazine.

[6]  Yunhao Liu,et al.  CitySee: not only a wireless sensor network , 2013, IEEE Network.

[7]  Vikram Krishnamurthy,et al.  Adaptive Scheme for Caching YouTube Content in a Cellular Network: Machine Learning Approach , 2017, IEEE Access.

[8]  Tao Jiang,et al.  Edge Computing Framework for Cooperative Video Processing in Multimedia IoT Systems , 2018, IEEE Transactions on Multimedia.

[9]  Deniz Gündüz,et al.  Learning-based optimization of cache content in a small cell base station , 2014, 2014 IEEE International Conference on Communications (ICC).

[10]  Paolo Bellavista,et al.  Convergence of MANET and WSN in IoT Urban Scenarios , 2013, IEEE Sensors Journal.

[11]  Yi Xu,et al.  Wireless Mesh Network in Smart Grid: Modeling and Analysis for Time Critical Communications , 2013, IEEE Transactions on Wireless Communications.

[12]  Marc Peter Deisenroth,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[13]  Ness B. Shroff,et al.  Heterogeneous Delay Tolerant Task Scheduling and Energy Management in the Smart Grid with Renewable Energy , 2013, IEEE Journal on Selected Areas in Communications.

[14]  Tao Jiang,et al.  QoE-aware resource allocation for adaptive device-to-device video streaming , 2015, IEEE Network.

[15]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[16]  Min Sheng,et al.  Learning-Based Content Caching and Sharing for Wireless Networks , 2017, IEEE Transactions on Communications.

[17]  R. L. Armentano,et al.  Integrated e-Health Approach Based on Vascular Ultrasound and Pulse Wave Analysis for Asymptomatic Atherosclerosis Detection and Cardiovascular Risk Stratification in the Community , 2012, IEEE Transactions on Information Technology in Biomedicine.

[18]  Rahim Tafazolli,et al.  In-network caching of Internet-of-Things data , 2014, 2014 IEEE International Conference on Communications (ICC).

[19]  Timo Ojala,et al.  Traffic in the Smart City: Exploring City-Wide Sensing for Traffic Control Center Augmentation , 2013, IEEE Internet Computing.

[20]  John N. Tsitsiklis,et al.  Actor-Critic Algorithms , 1999, NIPS.

[21]  Ioannis Lambadaris,et al.  IoT Data Lifetime-Based Cooperative Caching Scheme for ICN-IoT Networks , 2018, 2018 IEEE International Conference on Communications (ICC).

[22]  M. Draief,et al.  Placing dynamic content in caches with small population , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[23]  Mehdi Bennis,et al.  Living on the edge: The role of proactive caching in 5G wireless networks , 2014, IEEE Communications Magazine.

[24]  Zhu Han,et al.  Multi-Hop Cooperative Caching in Social IoT Using Matching Theory , 2018, IEEE Transactions on Wireless Communications.

[25]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[26]  Rahim Tafazolli,et al.  Caching Transient Data in Internet Content Routers , 2017, IEEE/ACM Transactions on Networking.