Reinforcement Learning-Enabled Cross-Layer Optimization for Low-Power and Lossy Networks under Heterogeneous Traffic Patterns

The next generation of the Internet of Things (IoT) networks is expected to handle a massive scale of sensor deployment with radically heterogeneous traffic applications, which leads to a congested network, calling for new mechanisms to improve network efficiency. Existing protocols are based on simple heuristics mechanisms, whereas the probability of collision is still one of the significant challenges of future IoT networks. The medium access control layer of IEEE 802.15.4 uses a distributed coordination function to determine the efficiency of accessing wireless channels in IoT networks. Similarly, the network layer uses a ranking mechanism to route the packets. The objective of this study was to intelligently utilize the cooperation of multiple communication layers in an IoT network. Recently, Q-learning (QL), a machine learning algorithm, has emerged to solve learning problems in energy and computational-constrained sensor devices. Therefore, we present a QL-based intelligent collision probability inference algorithm to optimize the performance of sensor nodes by utilizing channel collision probability and network layer ranking states with the help of an accumulated reward function. The simulation results showed that the proposed scheme achieved a higher packet reception ratio, produces significantly lower control overheads, and consumed less energy compared to current state-of-the-art mechanisms.

[1]  Liang Xiao,et al.  IoT Security Techniques Based on Machine Learning: How Do IoT Devices Use AI to Enhance Security? , 2018, IEEE Signal Processing Magazine.

[2]  Byung-Seo Kim,et al.  Fair and Efficient Channel Observation-Based Listen-Before Talk (CoLBT) for LAA-WiFi Coexistence in Unlicensed LTE , 2018, 2018 Tenth International Conference on Ubiquitous and Future Networks (ICUFN).

[3]  Hwee Pink Tan,et al.  Machine Learning in Wireless Sensor Networks: Algorithms, Strategies, and Applications , 2014, IEEE Communications Surveys & Tutorials.

[4]  Fadi Al-Turjman,et al.  The limitations in the state-of-the-art counter-measures against the security threats in H-IoT , 2020, Cluster Computing.

[5]  Thomas Narten,et al.  Neighbor Discovery for IP Version 6 (IPv6) , 1996, RFC.

[6]  Ali Kashif Bashir,et al.  A Survey on Resource Management in IoT Operating Systems , 2018, IEEE Access.

[7]  Hai L. Vu Collision Probability in Saturated IEEE 802.11 Networks , 2006 .

[8]  Maryam Aziz On Multi-armed Bandits Theory And Applications , 2019 .

[9]  Marco Wiering,et al.  Reinforcement Learning and Markov Decision Processes , 2012, Reinforcement Learning.

[10]  Aleksandrs Slivkins,et al.  Introduction to Multi-Armed Bandits , 2019, Found. Trends Mach. Learn..

[11]  Feng Wang,et al.  SL-RPL: Stability-Aware Load Balancing for RPL , 2020, Trans. Mach. Learn. Data Min..

[12]  Yousaf Bin Zikria,et al.  Energy-Aware Adaptive Trickle Timer Algorithm for RPL-based Routing in the Internet of Things , 2018, 2018 28th International Telecommunication Networks and Applications Conference (ITNAC).

[13]  Biplab Sikdar,et al.  A Survey of MAC Layer Issues and Protocols for Machine-to-Machine Communications , 2015, IEEE Internet of Things Journal.

[14]  Amit P. Sheth,et al.  Machine learning for Internet of Things data analysis: A survey , 2017, Digit. Commun. Networks.

[15]  Takeo Fujii,et al.  Hidden node aware routing method using high-sensitive sensing device for multi-hop wireless mesh network , 2011, EURASIP J. Wirel. Commun. Netw..

[16]  Halima Elbiaze,et al.  CLRPL: Context-Aware and Load Balancing RPL for Iot Networks Under Heavy and Highly Dynamic Load , 2018, IEEE Access.

[17]  Byung-Seo Kim,et al.  Q-learning-enabled channel access in next-generation dense wireless networks for IoT-based eHealth systems , 2019, EURASIP J. Wirel. Commun. Netw..

[18]  Philip Levis,et al.  The Minimum Rank with Hysteresis Objective Function , 2012, RFC.

[19]  Wang-Chien Lee,et al.  Processing k nearest neighbor queries in location-aware sensor networks , 2007, Signal Process..

[20]  Mohsen Guizani,et al.  Internet of Things Architecture: Recent Advances, Taxonomy, Requirements, and Open Challenges , 2017, IEEE Wireless Communications.

[21]  Michael L. Littman,et al.  Online Linear Regression and Its Application to Model-Based Reinforcement Learning , 2007, NIPS.

[22]  Philip Levis,et al.  RPL: IPv6 Routing Protocol for Low-Power and Lossy Networks , 2012, RFC.

[23]  Wu He,et al.  Internet of Things in Industries: A Survey , 2014, IEEE Transactions on Industrial Informatics.

[24]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[25]  Abderrezak Rachedi,et al.  Fuzzy-Based Objective Function for Routing Protocol in the Internet of Things , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[26]  Yousaf Bin Zikria,et al.  Routing protocol for Low-Power and Lossy Networks for heterogeneous traffic network , 2020, EURASIP J. Wirel. Commun. Netw..

[27]  Athanasios V. Vasilakos,et al.  The Future of Healthcare Internet of Things: A Survey of Emerging Technologies , 2020, IEEE Communications Surveys & Tutorials.

[28]  Muqing Wu,et al.  A Novel RPL Algorithm Based on Chaotic Genetic Algorithm , 2018, Sensors.

[29]  Taskin Koçak,et al.  Smart Grid Technologies: Communication Technologies and Standards , 2011, IEEE Transactions on Industrial Informatics.

[30]  Xiongwen Zhao,et al.  Access Control and Resource Allocation for M2M Communications in Industrial Automation , 2019, IEEE Transactions on Industrial Informatics.

[31]  Prakash Veeraraghavan,et al.  The Relation between the Probability of Collision-Free Broadcast Transmission in a Wireless Network and the Stirling Number of the Second Kind , 2018 .

[32]  Pascal Lorenz,et al.  E-RPL: A Routing Protocol for IoT Networks , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[33]  Gihwan Cho,et al.  CoAR: Congestion-Aware Routing Protocol for Low Power and Lossy Networks for IoT Applications , 2018, Sensors.

[34]  Byung-Seo Kim,et al.  Channel observation-based scaled backoff mechanism for high-efficiency WLANs , 2018 .

[35]  J.T. Adams An introduction to IEEE STD 802.15.4 , 2006, 2006 IEEE Aerospace Conference.

[36]  Michel Tokic Adaptive ε-greedy Exploration in Reinforcement Learning Based on Value Differences , 2010 .

[37]  Byung-Seo Kim,et al.  Design of MAC Layer Resource Allocation Schemes for IEEE 802.11ax: Future Directions , 2018 .

[38]  Shi Jin,et al.  Channel Estimation for Massive MIMO Using Gaussian-Mixture Bayesian Learning , 2015, IEEE Transactions on Wireless Communications.

[39]  Jianming Wei,et al.  Toward Improved RPL: A Congestion Avoidance Multipath Routing Protocol with Time Factor for Wireless Sensor Networks , 2016, J. Sensors.

[40]  Enzo Mingozzi,et al.  A reinforcement learning-based link quality estimation strategy for RPL and its impact on topology management , 2017, Comput. Commun..

[41]  Wajeb Gharibi,et al.  A New Load-Balancing Aware Objective Function for RPL's IoT Networks , 2018, 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[42]  Kee Chaing Chua,et al.  A Capacity Analysis for the IEEE 802.11 MAC Protocol , 2001, Wirel. Networks.

[43]  Xiongwen Zhao,et al.  Learning-Based Context-Aware Resource Allocation for Edge-Computing-Empowered Industrial IoT , 2020, IEEE Internet of Things Journal.

[44]  Yisheng Zhao,et al.  Support vector machine for channel prediction in high-speed railway communication systems , 2018, 2018 IEEE MTT-S International Wireless Symposium (IWS).

[45]  Saewoong Bahk,et al.  Load Balancing Under Heavy Traffic in RPL Routing Protocol for Low Power and Lossy Networks , 2017, IEEE Transactions on Mobile Computing.

[46]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[47]  Danilo Bzdok,et al.  Points of significance: Machine learning: supervised methods , 2018, Nature Methods.

[48]  Pascal Thubert,et al.  Objective Function Zero for the Routing Protocol for Low-Power and Lossy Networks (RPL) , 2012, RFC.

[49]  Byung-Seo Kim,et al.  A Periodic Caching Strategy Solution for the Smart City in Information-Centric Internet of Things , 2018, Sustainability.