Intelligent Power Control for Spectrum Sharing in Cognitive Radios: A Deep Reinforcement Learning Approach

We consider the problem of spectrum sharing in a cognitive radio system consisting of a primary user and a secondary user. The primary user and the secondary user work in a non-cooperative manner. Specifically, the primary user is assumed to update its transmitted power based on a pre-defined power control policy. The secondary user does not have any knowledge about the primary user’s transmit power, or its power control strategy. The objective of this paper is to develop a learning-based power control method for the secondary user in order to share the common spectrum with the primary user. To assist the secondary user, a set of sensor nodes are spatially deployed to collect the received signal strength information at different locations in the wireless environment. We develop a deep reinforcement learning-based method, which the secondary user can use to intelligently adjust its transmit power such that after a few rounds of interaction with the primary user, both users can transmit their own data successfully with required qualities of service. Our experimental results show that the secondary user can interact with the primary user efficiently to reach a goal state (defined as a state in which both users can successfully transmit their data) from any initial states within a few number of steps.

[1]  Mohammed Nafie,et al.  Admission and Power Control for Spectrum Sharing Cognitive Radio Networks , 2011, IEEE Transactions on Wireless Communications.

[2]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access , 2017, IEEE Transactions on Wireless Communications.

[3]  Xianfu Chen,et al.  Stochastic Power Adaptation with Multiagent Reinforcement Learning for Cognitive Wireless Mesh Networks , 2013, IEEE Transactions on Mobile Computing.

[4]  Ness B. Shroff,et al.  A utility-based power-control scheme in wireless cellular systems , 2003, TNET.

[5]  Amr El-Keyi,et al.  Power Control for Constrained Throughput Maximization in Spectrum Shared Networks , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[6]  R. Bellman Dynamic programming. , 1957, Science.

[7]  Husheng Li Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems , 2010, EURASIP J. Wirel. Commun. Netw..

[8]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[9]  Mihaela van der Schaar,et al.  Learning to Compete for Resources in Wireless Stochastic Games , 2009, IEEE Transactions on Vehicular Technology.

[10]  Ying-Chang Liang,et al.  Distributed Power and Admission Control for Cognitive Radio Networks Using Antenna Arrays , 2007, 2007 2nd IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks.

[11]  Ioannis Mitliagkas,et al.  Joint Power and Admission Control for Ad-Hoc and Cognitive Underlay Networks: Convex Approximation and Distributed Implementation , 2011, IEEE Transactions on Wireless Communications.

[12]  Jun Fang,et al.  Multiantenna-Assisted Spectrum Sensing for Cognitive Radio , 2010, IEEE Transactions on Vehicular Technology.

[13]  Rajarathnam Chandramouli,et al.  Dynamic Spectrum Access with QoS and Interference Temperature Constraints , 2007, IEEE Transactions on Mobile Computing.

[14]  Sirin Tekinay,et al.  Optimal Power Allocation in NOMA Systems with Imperfect Channel Estimation , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[15]  Rekha Jain,et al.  Wireless Sensor Network -A Survey , 2013 .

[16]  Ya-Feng Liu,et al.  Sample Approximation-Based Deflation Approaches for Chance SINR-Constrained Joint Power and Admission Control , 2013, IEEE Transactions on Wireless Communications.

[17]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[18]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Dynamic Spectrum Access in Multichannel Wireless Networks , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[19]  Lin Gao,et al.  Two-Sided Matching Based Cooperative Spectrum Sharing , 2016, IEEE Transactions on Mobile Computing.

[20]  Dong In Kim,et al.  Joint rate and power allocation for cognitive radios in dynamic spectrum access environment , 2008, IEEE Transactions on Wireless Communications.

[21]  Bin Li,et al.  Adaptive power control algorithm in cognitive radio based on game theory , 2015, IET Commun..

[22]  Mehdi Bennis,et al.  A Q-learning based approach to interference avoidance in self-organized femtocell networks , 2010, 2010 IEEE Globecom Workshops.

[23]  Tao Jiang,et al.  Deep learning for wireless physical layer: Opportunities and challenges , 2017, China Communications.

[24]  Yuan Wu,et al.  Revenue Sharing Based Resource Allocation for Dynamic Spectrum Access Networks , 2014, IEEE Journal on Selected Areas in Communications.

[25]  Marko Beko,et al.  RSS-Based Localization in Wireless Sensor Networks Using Convex Relaxation: Noncooperative and Cooperative Schemes , 2015, IEEE Transactions on Vehicular Technology.

[26]  Anjali Agarwal,et al.  Spectrum sharing in multi-service cognitive network using reinforcement learning , 2009, 2009 First UK-India International Workshop on Cognitive Wireless Systems (UKIWCWS).

[27]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[28]  H. Vincent Poor,et al.  Reinforcement learning based distributed multiagent sensing policy for cognitive radio networks , 2011, 2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN).

[29]  Roy D. Yates,et al.  Constrained power control , 1994, Wirel. Pers. Commun..

[30]  Weifeng Su,et al.  Active Cooperation Between Primary Users and Cognitive Radio Users in Heterogeneous Ad-Hoc Networks , 2012, IEEE Transactions on Signal Processing.

[31]  Tamer A. ElBatt,et al.  Joint scheduling and power control for wireless ad hoc networks , 2002, IEEE Transactions on Wireless Communications.

[32]  Tiina Heikkinen,et al.  A potential game approach to distributed power control and scheduling , 2006, Comput. Networks.

[33]  Vijay K. Bhargava,et al.  Cognitive Wireless Communication Networks , 2007 .

[34]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[35]  Yuan Wu,et al.  Cooperative spectrum sharing in cognitive radio networks with proactive primary system , 2013, 2013 IEEE/CIC International Conference on Communications in China - Workshops (CIC/ICCC).

[36]  Shiqian Ma,et al.  Joint Power and Admission Control: Non-Convex $L_{q}$ Approximation and An Effective Polynomial Time Deflation Approach , 2013, IEEE Transactions on Signal Processing.

[37]  Theodore S. Rappaport,et al.  Wireless communications - principles and practice , 1996 .

[38]  Simon Haykin,et al.  Cognitive radio: brain-empowered wireless communications , 2005, IEEE Journal on Selected Areas in Communications.

[39]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[40]  Rui Zhang,et al.  Retrodirective Multi-User Wireless Power Transfer With Massive MIMO , 2017, IEEE Wireless Communications Letters.

[41]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.