Performance Optimization for Cooperative Multiuser Cognitive Radio Networks with RF Energy Harvesting Capability

We study the performance optimization problem for a cognitive radio network with radio frequency (RF) energy harvesting capability for secondary users. In such networks, the secondary users are able to not only transmit packets on a channel licensed to a primary user when the channel is idle, but also harvest RF energy from the primary users' transmissions when the channel is busy. Specifically, we propose a system model where the secondary users are able to cooperate to maximize the overall network throughput through sensing a set of common channels. We first consider the case where the secondary users cooperate in a TDMA fashion and propose a novel solution based on a learning algorithm to find optimal channel access policies for the secondary users. Then, we examine the case where the secondary users cooperate in a decentralized manner and we formulate the cooperative decentralized optimization problem as a decentralized partially observable Markov decision process (DEC-POMDP). To solve the cooperative decentralized stochastic optimization problem, we apply a decentralized learning algorithm based on the policy gradient and the Lagrange multiplier method to obtain optimal channel access policies. Extensive performance evaluation is conducted and it shows the efficiency and the convergence of the learning algorithms.

[1]  Mustafa Cenk Gursoy,et al.  Cognitive Radio Transmissions Exploiting Multi-User Diversity under Channel and Sensing Uncertainty , 2013, IEEE Communications Letters.

[2]  Abhijit Gosavi,et al.  Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning , 2003 .

[3]  Ahmed Sultan Sensing and Transmit Energy Optimization for an Energy Harvesting Cognitive Radio , 2012, IEEE Wireless Communications Letters.

[4]  Dinh Thai Hoang,et al.  Performance analysis of cognitive machine-to-machine communications , 2012, 2012 IEEE International Conference on Communication Systems (ICCS).

[5]  Dong In Kim,et al.  Opportunistic Channel Access and RF Energy Harvesting in Cognitive Radio Networks , 2014, IEEE Journal on Selected Areas in Communications.

[6]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[7]  Dong In Kim,et al.  Channel selection in cognitive radio networks with opportunistic RF energy harvesting , 2014, 2014 IEEE International Conference on Communications (ICC).

[8]  C. Mikeka,et al.  Design Issues in Radio Frequency Energy Harvesting System , 2011 .

[9]  Zhu Han,et al.  Dynamic Spectrum Access and Management in Cognitive Radio Networks: References , 2009 .

[10]  John N. Tsitsiklis,et al.  Simulation-based optimization of Markov reward processes , 1998, Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171).

[11]  John N. Tsitsiklis,et al.  Gradient Convergence in Gradient methods with Errors , 1999, SIAM J. Optim..

[12]  K. Schittkowski,et al.  NONLINEAR PROGRAMMING , 2022 .

[13]  V. Borkar Stochastic Approximation: A Dynamical Systems Viewpoint , 2008 .

[14]  Fernando J. Velez,et al.  Electromagnetic Energy Harvesting for Wireless Body Area Networks with Cognitive Radio Capabilities , 2012 .

[15]  Xiaodong Lin,et al.  On exploiting polarization for energy-harvesting enabled cooperative cognitive radio networking , 2013, IEEE Wireless Communications.

[16]  Peter L. Bartlett,et al.  Experiments with Infinite-Horizon, Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..

[17]  Sungsoo Park,et al.  Cognitive Radio Networks with Energy Harvesting , 2013, IEEE Transactions on Wireless Communications.

[18]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[19]  Prusayon Nintanavongsa,et al.  Design Optimization and Implementation for RF Energy Harvesting Circuits , 2012, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[20]  Ekram Hossain,et al.  Dynamic Spectrum Access and Management in Cognitive Radio Networks: Introduction , 2009 .

[21]  Kaibin Huang,et al.  Opportunistic Wireless Energy Harvesting in Cognitive Radio Networks , 2013, IEEE Transactions on Wireless Communications.

[22]  Jiaru Lin,et al.  An online energy allocation strategy for energy harvesting cognitive radio systems , 2013, 2013 International Conference on Wireless Communications and Signal Processing.

[23]  Sungsoo Park,et al.  Optimal mode selection for cognitive radio sensor networks with RF energy harvesting , 2012, 2012 IEEE 23rd International Symposium on Personal, Indoor and Mobile Radio Communications - (PIMRC).

[24]  Mischa Schwartz,et al.  Mobile Wireless Communications: Access and scheduling techniques in cellular systems , 2004 .

[25]  Olivier Buffet,et al.  Shaping multi-agent systems with gradient reinforcement learning , 2007, Autonomous Agents and Multi-Agent Systems.

[26]  Kah Phooi Seng,et al.  Radio Frequency Energy Harvesting and Management for Wireless Sensor Networks , 2012, ArXiv.

[27]  Mani Srivastava,et al.  Energy-aware wireless microsensor networks , 2002, IEEE Signal Process. Mag..

[28]  Sungsoo Park,et al.  Optimal Spectrum Access for Energy Harvesting Cognitive Radio Networks , 2013, IEEE Transactions on Wireless Communications.

[29]  J. Tsitsiklis,et al.  Convergence rate of linear two-time-scale stochastic approximation , 2004, math/0405287.

[30]  Abhijit Gosavi,et al.  Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning , 2003 .