Smart duty cycle control with reinforcement learning for machine to machine communications

Machine to machine (M2M) communications is one of the key underpinning technologies for Internet of Things (IoT) applications in 5G networks. The large scale of M2M devices imposes challenge on conventional medium access control protocols. In this paper, we propose a reinforcement learning (RL) based duty cycle control for dominant short-range technology IEEE 802.15.4 to provide high performance and reliable M2M communication. We first model a practical multi-hop M2M communication network that takes various network dynamics into consideration. Then, we mathematically derive the distributed optimal duty cycle control policy to optimise the energy efficiency, end-to-end delay and transmission reliability. Finally, a RL based practical duty cycle control is developed to learn the optimal policy directly without priori network information, which contributes to the smart duty cycle control under various network dynamics. Simulation results show that the proposed RL based duty cycle control achieves the best balance between optimality and stability, compared with the optimal and the existing IEEE 802.15.4 duty cycle controls.

[1]  Chiara Buratti,et al.  Performance Analysis of IEEE 802.15.4 Beacon-Enabled Mode , 2010, IEEE Transactions on Vehicular Technology.

[2]  Dirk Pesch,et al.  Duty cycle learning algorithm (DCLA) for IEEE 802.15.4 beacon-enabled wireless sensor networks , 2012, Ad Hoc Networks.

[3]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[4]  LooJonathan,et al.  Duty cycle control with joint optimisation of delay and energy efficiency for capillary machine-to-machine networks in 5G communication system , 2015 .

[5]  Athanasios S. Lioumpas,et al.  Analytical modelling and performance evaluation of realistic time-controlled M2M scheduling over LTE cellular networks , 2013, Trans. Emerg. Telecommun. Technol..

[6]  Zuoyin Tang,et al.  Evaluating effectiveness of IEEE 802.15.4 networks for M2M communications , 2014 .

[7]  Wook Hyun Kwon,et al.  Throughput and energy consumption analysis of IEEE 802.15.4 slotted CSMA/CA , 2005 .

[8]  Hsiao-Hwa Chen,et al.  Evaluating Effectiveness of IEEE 802.15.4 Networks for M2M Communications , 2014 .

[9]  Wei Xiang,et al.  Radio resource allocation in LTE-advanced cellular networks with M2M communications , 2012, IEEE Communications Magazine.

[10]  H. Scarf THE OPTIMALITY OF (S,S) POLICIES IN THE DYNAMIC INVENTORY PROBLEM , 1959 .

[11]  Dimitri P. Bertsekas,et al.  Dynamic programming and optimal control, 3rd Edition , 2005 .

[12]  Mao Jianlin,et al.  RL-based superframe order adaptation algorithm for IEEE 802.15.4 networks , 2009, 2009 Chinese Control and Decision Conference.

[13]  Zhenzhen Liu,et al.  RL-MAC: a reinforcement learning based MAC protocol for wireless sensor networks , 2006, Int. J. Sens. Networks.

[14]  Carlo Fischione,et al.  Duty-cycle optimization for IEEE 802.15.4 wireless sensor networks , 2013, ACM Trans. Sens. Networks.

[15]  Wendi B. Heinzelman,et al.  Energy-Efficient Duty Cycle Assignment for Receiver-Based Convergecast in Wireless Sensor Networks , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[16]  Junglok Yu,et al.  Adaptive Duty Cycle Control with Queue Management in Wireless Sensor Networks , 2013, IEEE Transactions on Mobile Computing.

[17]  Tiankui Zhang,et al.  Opportunistic User Association for Multi-Service HetNets Using Nash Bargaining Solution , 2014, IEEE Communications Letters.

[18]  Jonathan Loo,et al.  Optimised delay-energy aware duty cycle control for IEEE 802.15.4 with cumulative acknowledgement , 2014, 2014 IEEE 25th Annual International Symposium on Personal, Indoor, and Mobile Radio Communication (PIMRC).

[19]  Monique Becker,et al.  Effect of Topology on the Performance of Mobile Heterogeneous Sensor Networks , 2007 .

[20]  Gen-Huey Chen,et al.  Utilization based duty cycle tuning MAC protocol for wireless sensor networks , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[21]  Jonathan Loo,et al.  Duty cycle control with joint optimisation of delay and energy efficiency for capillary machine‐to‐machine networks in 5G communication system , 2015, Trans. Emerg. Telecommun. Technol..

[22]  Guoliang Xing,et al.  Dynamic duty cycle control for end-to-end delay guarantees in wireless sensor networks , 2010, 2010 IEEE 18th International Workshop on Quality of Service (IWQoS).