Model free dynamic sensing order selection for imperfect sensing multichannel cognitive radio networks: A Q-learning approach

In multichannel cognitive radio networks (CRN), the sequence of sensing order is essential to determine the efficiency of detecting an idle channel and the effective time for spectrum access. Current works on the optimal channel sensing order have to know priori knowledge such as the channel idle probabilities, channel gains and achievable rates of different channels, which dynamically evolve and are difficult to obtain in practice. We are thus motivated to study a model-free sensing order selection scheme for multichannel CRN. In particular, we consider a time-slotted imperfect sensing CRN, where sensing errors impose more challenges to the selection of sensing order. The sensing order selection is modeled as a Q-Learning problem, where secondary users (SU) make intelligent decision on sensing order selection by learning from historical sensing and transmissions. Simulation results show that theq proposed scheme is robust in dynamic environment and can achieve significant throughput improvement compared with random sequential sensing.

[1]  H. Vincent Poor,et al.  Optimal selection of channel sensing order in cognitive radio , 2009, IEEE Transactions on Wireless Communications.

[2]  Joseph Mitola,et al.  Cognitive radio: making software radios more personal , 1999, IEEE Wirel. Commun..

[3]  Xuemin Shen,et al.  HC-MAC: A Hardware-Constrained Cognitive MAC for Efficient Spectrum Management , 2008, IEEE Journal on Selected Areas in Communications.

[4]  Shaojie Tang,et al.  Almost Optimal Dynamically-Ordered Channel Sensing and Accessing for Cognitive Networks , 2014, IEEE Transactions on Mobile Computing.

[5]  Ananthram Swami,et al.  Joint Design and Separation Principle for Opportunistic Spectrum Access in the Presence of Sensing Errors , 2007, IEEE Transactions on Information Theory.

[6]  Xuemin Shen,et al.  Guest Editorial Vehicular Communications and Networks—Part II , 2011 .

[7]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[8]  Qing Zhao,et al.  Learning in a Changing World: Restless Multiarmed Bandit With Unknown Dynamics , 2010, IEEE Transactions on Information Theory.

[9]  Martin Herdegen Optimal Stopping and Applications Example 2 : American options , 2009 .

[10]  José Ferreira de Rezende,et al.  Channel sensing order for cognitive radio networks using reinforcement learning , 2011, 2011 IEEE 36th Conference on Local Computer Networks.

[11]  Weihua Zhuang,et al.  Simple Channel Sensing Order in Cognitive Radio Networks , 2011, IEEE Journal on Selected Areas in Communications.

[12]  Ananthram Swami,et al.  Decentralized cognitive MAC for opportunistic spectrum access in ad hoc networks: A POMDP framework , 2007, IEEE Journal on Selected Areas in Communications.

[13]  H. Vincent Poor,et al.  Cognitive Medium Access: Exploration, Exploitation, and Competition , 2007, IEEE Transactions on Mobile Computing.