Cooperative Communications With Relay Selection Based on Deep Reinforcement Learning in Wireless Sensor Networks

Cooperative communication technology has become a research hotspot in wireless sensor networks (WSNs) in recent years, and will become one of the key technologies for improving spectrum utilization in wireless communication systems in the future. It leverages cooperation among multiple relay nodes in the wireless network to realize path transmission sharing, thereby improving the system throughput. In this paper, we model the process of cooperative communications with relay selection in WSNs as a Markov decision process and propose DQ-RSS, a deep-reinforcement-learning-based relay selection scheme, in WSNs. In DQ-RSS, a deep-Q-network (DQN) is trained according to the outage probability and mutual information, and the optimal relay is selected from a plurality of relay nodes without the need for a network model or prior data. More specifically, we use DQN to process high-dimensional state spaces and accelerate the learning rate. We compare DQ-RSS with the Q-learning-based relay selection scheme and evaluate the network performance on the basis of three aspects: outage probability, system capacity, and energy consumption. Simulation results indicate that DQ-RSS can achieve better performance on these elements and save the convergence time compared with existing schemes.

[1]  Feng Liu,et al.  Energy-efficient cooperative communication for data transmission in wireless sensor networks , 2010, IEEE Transactions on Consumer Electronics.

[2]  Li Li,et al.  Cooperative communication based on random beamforming strategy in wireless sensor networks , 2012, 2012 IEEE Global Communications Conference (GLOBECOM).

[3]  Mohsen Guizani,et al.  An effective key management scheme for heterogeneous sensor networks , 2007, Ad Hoc Networks.

[4]  Larry J. Greenstein,et al.  An empirically based path loss model for wireless channels in suburban environments , 1999, IEEE J. Sel. Areas Commun..

[5]  Mohsen Guizani,et al.  LTE-U and Wi-Fi Coexistence Algorithm Based on Q-Learning in Multi-Channel , 2018, IEEE Access.

[6]  Hyundong Shin,et al.  Cooperative Communications with Outage-Optimal Opportunistic Relaying , 2007, IEEE Transactions on Wireless Communications.

[7]  Mohsen Guizani,et al.  Transactions papers a routing-driven Elliptic Curve Cryptography based key management scheme for Heterogeneous Sensor Networks , 2009, IEEE Transactions on Wireless Communications.

[8]  Takeshi Shibuya,et al.  Q-Learning in Continuous State-Action Space with Noisy and Redundant Inputs by Using a Selective Desensitization Neural Network , 2015, J. Adv. Comput. Intell. Intell. Informatics.

[9]  Weihua Zhuang,et al.  Anti-Jamming Communication Game for UAV-Aided VANETs , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[10]  Mark Humphrys W-learning: Competition among selfish Q-learners , 1995 .

[11]  Giancarlo Fortino,et al.  QL-MAC: A Q-Learning Based MAC for Wireless Sensor Networks , 2013, ICA3PP.

[12]  Gregory W. Wornell,et al.  Cooperative diversity in wireless networks: Efficient protocols and outage behavior , 2004, IEEE Transactions on Information Theory.

[13]  Li Sun,et al.  Cooperative communications with relay selection in wireless sensor networks , 2009, IEEE Transactions on Consumer Electronics.

[14]  Vincent W. S. Wong,et al.  Cooperative Protocols Design for Wireless Ad-Hoc Networks with Multi-hop Routing , 2008, QShine '08.

[15]  Derong Liu,et al.  A Novel Dual Iterative $Q$-Learning Method for Optimal Battery Management in Smart Residential Environments , 2015, IEEE Transactions on Industrial Electronics.

[16]  Wenhao Huang,et al.  Deep Architecture for Traffic Flow Prediction: Deep Belief Networks With Multitask Learning , 2014, IEEE Transactions on Intelligent Transportation Systems.

[17]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[18]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[19]  Aria Nosratinia,et al.  Cooperative communication in wireless networks , 2004, IEEE Communications Magazine.

[20]  Aggelos Bletsas,et al.  A simple Cooperative diversity method based on network path selection , 2005, IEEE Journal on Selected Areas in Communications.

[21]  Zhu Han,et al.  Distributed Energy-Efficient Cooperative Routing in Wireless Networks , 2007, IEEE GLOBECOM 2007 - IEEE Global Telecommunications Conference.

[22]  Li Ren,et al.  A Multiagent Q-Learning-Based Optimal Allocation Approach for Urban Water Resource Management System , 2014, IEEE Transactions on Automation Science and Engineering.

[23]  Giancarlo Fortino,et al.  Lightweight Reinforcement Learning for Energy Efficient Communications in Wireless Sensor Networks , 2019, IEEE Access.

[24]  Sunghwan Kim,et al.  Relay selection Algorithm for wireless cooperative networks: a learning-based approach , 2017, IET Commun..

[25]  Weihua Zhuang,et al.  UAV Relay in VANETs Against Smart Jamming With Reinforcement Learning , 2018, IEEE Transactions on Vehicular Technology.

[26]  Ah-Hwee Tan,et al.  Fast Reinforcement Learning under Uncertainties with Self-Organizing Neural Networks , 2015, 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT).

[27]  Lianfen Huang,et al.  QRED: A Q-Learning-based Active Queue Management Scheme , 2018 .

[28]  Victor C. M. Leung,et al.  Cooperative Communications with Relay Selection for QoS Provisioning in Wireless Sensor Networks , 2009, GLOBECOM 2009 - 2009 IEEE Global Telecommunications Conference.

[29]  Jianlin Cheng,et al.  A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[30]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[31]  Xiaojiang Du,et al.  A survey of key management schemes in wireless sensor networks , 2007, Comput. Commun..

[32]  K. J. Ray Liu,et al.  Cooperative communications with relay-selection: when to cooperate and whom to cooperate with? , 2008, IEEE Transactions on Wireless Communications.