Optimal cooperative cognitive relaying and spectrum access for an energy harvesting cognitive radio: Reinforcement learning approach

In this paper, we consider a cognitive setting under the context of cooperative communications, where the cognitive radio (CR) user is assumed to be a self-organized relay for the network. The CR user and the primary user (PU) are assumed to be energy harvesters. The CR user cooperatively relays some of the undelivered packets of the PU. Specifically, the CR user stores a fraction of the undelivered primary packets in a relaying queue (buffer). It manages the flow of the undelivered primary packets to its relaying queue using the appropriate actions over time slots. Moreover, it has the decision of choosing the used queue for channel accessing at idle time slots (slots where the PU's queue is empty). It is assumed that one data packet transmission dissipates one energy packet. The optimal policy changes according to the primary and CR users arrival rates to the data and energy queues as well as the channels connectivity. The CR user saves energy for the PU by taking the responsibility of relaying the undelivered primary packets. It optimally organizes its own energy packets to maximize its payoff as time progresses.

[1]  Anthony Ephremides,et al.  On the stability of interacting queues in a multiple-access system , 1988, IEEE Trans. Inf. Theory.

[2]  Rui Zhang,et al.  Optimal energy allocation for wireless communications powered by energy harvesters , 2010, 2010 IEEE International Symposium on Information Theory.

[3]  Anthony Ephremides,et al.  Optimal utilization of a cognitive shared channel with a rechargeable primary source node , 2012, Journal of Communications and Networks.

[4]  Mazen O. Hasna,et al.  Cognitive Relaying in Wireless Sensor Networks: Performance Analysis and Optimization , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[5]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[6]  O. Simeone,et al.  Stability analysis of the cognitive interference channel , 2006, 2006 Fortieth Asilomar Conference on Signals, Systems and Computers.

[7]  Doina Precup,et al.  Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[8]  Yonghong Zeng,et al.  Opportunistic spectrum access for energy-constrained cognitive radios , 2008, IEEE Transactions on Wireless Communications.

[9]  Ahmed El Shafie,et al.  Comments on “Optimal utilization of a cognitive shared channel with a rechargeable primary source node” , 2014, Journal of Communications and Networks.

[10]  Mohammed Nafie,et al.  Cognitive radio transmission strategies for primary erasure channels , 2013, 2013 Asilomar Conference on Signals, Systems and Computers.

[11]  Ahmed El Shafie,et al.  Optimal selection of spectrum sensing duration for an energy harvesting cognitive radio , 2013, 2013 IEEE Global Communications Conference (GLOBECOM).

[12]  Ana Galindo-Serrano,et al.  Distributed Q-Learning for Aggregated Interference Control in Cognitive Radio Networks , 2010, IEEE Transactions on Vehicular Technology.

[13]  K. J. Ray Liu,et al.  Cognitive multiple access via cooperation: Protocol design and performance analysis , 2007, IEEE Transactions on Information Theory.

[14]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[15]  Jing Yang,et al.  Transmission with Energy Harvesting Nodes in Fading Wireless Channels: Optimal Policies , 2011, IEEE Journal on Selected Areas in Communications.

[16]  Süleyman Özekici,et al.  Markov modulated Bernoulli process , 1997, Math. Methods Oper. Res..

[17]  Sungsoo Park,et al.  Energy-efficient opportunistic spectrum access in cognitive radio networks with energy harvesting , 2011, CogART '11.

[18]  Amr El-Keyi,et al.  On the coexistence of a primary user with an energy harvesting secondary user: a case of cognitive cooperation , 2016, Wirel. Commun. Mob. Comput..

[19]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[20]  Ahmed El Shafie,et al.  Optimal Random Access for a Cognitive Radio Terminal with Energy Harvesting Capability , 2013, IEEE Communications Letters.

[21]  Anthony Ephremides,et al.  Stable throughput tradeoffs in cognitive shared channels with cooperative relaying , 2011, 2011 Proceedings IEEE INFOCOM.

[22]  John S. Thompson,et al.  Stability Analysis and Power Optimization for Energy Harvesting Cooperative Networks , 2012, IEEE Signal Processing Letters.

[23]  Ahmed Sultan Sensing and Transmit Energy Optimization for an Energy Harvesting Cognitive Radio , 2012, IEEE Wireless Communications Letters.

[24]  Vinod Sharma,et al.  Optimal energy management policies for energy harvesting sensor nodes , 2008, IEEE Transactions on Wireless Communications.