Almost Optimal Dynamically-Ordered Channel Sensing and Accessing for Cognitive Networks

For cognitive wireless networks, one challenge is that the status and statistics of the channels' availability are difficult to predict. Numerous learning based online channel sensing and accessing strategies have been proposed to address such challenge. In this work, we propose a novel channel sensing and accessing strategy that carefully balances the channel statistics exploration and multichannel diversity exploitation. Unlike traditional MAB-based approaches, in our scheme, a secondary cognitive radio user will sequentially sense the status of multiple channels in a carefully designed order. We formulate the online sequential channel sensing and accessing problem as a sequencing multi-armed bandit problem, and propose a novel policy whose regret is in optimal logarithmic rate in time and polynomial in the number of channels. We conduct extensive simulations to compare the performance of our method with traditional MAB-based approach. Simulation results show that the proposed scheme improves the throughput by more than 30% and speeds up the learning process by more than 100%.

[1]  R. Agrawal Sample mean based index policies by O(log n) regret for the multi-armed bandit problem , 1995, Advances in Applied Probability.

[2]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[3]  Edward W. Knightly,et al.  Opportunistic Spectral Usage: Bounds and a Multi-Band CSMA/CA Protocol , 2007, IEEE/ACM Transactions on Networking.

[4]  Demosthenis Teneketzis,et al.  Multi-Armed Bandit Problems , 2008 .

[5]  Thomas P. Hayes,et al.  Stochastic Linear Optimization under Bandit Feedback , 2008, COLT.

[6]  Prasanna Chaporkar,et al.  Optimal joint probing and transmission strategy for maximizing throughput in wireless systems , 2008, IEEE Journal on Selected Areas in Communications.

[7]  Sudipto Guha,et al.  Information Acquisition and Exploitation in Multichannel Wireless Networks , 2008, ArXiv.

[8]  Marwan Krunz,et al.  Throughput-efficient sequential channel sensing and probing in cognitive radio networks under sensing errors , 2009, MobiCom '09.

[9]  John N. Tsitsiklis,et al.  A Structured Multiarmed Bandit Problem and the Greedy Policy , 2008, IEEE Transactions on Automatic Control.

[10]  A. Sahai,et al.  How much white space is there ? , 2009 .

[11]  Hai Jiang,et al.  Channel Sensing-Order Setting in Cognitive Radio Networks: A Two-User Case , 2009, IEEE Transactions on Vehicular Technology.

[12]  H. Vincent Poor,et al.  Optimal selection of channel sensing order in cognitive radio , 2009, IEEE Transactions on Wireless Communications.

[13]  Mingyan Liu,et al.  Optimal Channel Probing and Transmission Scheduling for Opportunistic Spectrum Access , 2007, IEEE/ACM Transactions on Networking.

[14]  Yi Gai,et al.  Learning Multiuser Channel Allocations in Cognitive Radio Networks: A Combinatorial Multi-Armed Bandit Formulation , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[15]  Keith D. Kastella,et al.  Foundations and Applications of Sensor Management , 2010 .

[16]  Ao Tang,et al.  Opportunistic Spectrum Access with Multiple Users: Learning under Competition , 2010, 2010 Proceedings IEEE INFOCOM.

[17]  Qing Zhao,et al.  Distributed Learning in Multi-Armed Bandit With Multiple Players , 2009, IEEE Transactions on Signal Processing.

[18]  Mingyan Liu,et al.  Online learning in opportunistic spectrum access: A restless bandit approach , 2010, 2011 Proceedings IEEE INFOCOM.

[19]  Rong Zheng,et al.  Sequential learning for optimal monitoring of multi-channel wireless networks , 2011, 2011 Proceedings IEEE INFOCOM.

[20]  H. Vincent Poor,et al.  Cognitive Medium Access: Exploration, Exploitation, and Competition , 2007, IEEE Transactions on Mobile Computing.

[21]  Yiyang Pei,et al.  Energy-Efficient Design of Sequential Channel Sensing in Cognitive Radio Networks: Optimal Sensing Strategy, Power Allocation, and Sensing Order , 2011, IEEE Journal on Selected Areas in Communications.

[22]  Ananthram Swami,et al.  Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret , 2010, IEEE Journal on Selected Areas in Communications.

[23]  Bhaskar Krishnamachari,et al.  Combinatorial Network Optimization With Unknown Variables: Multi-Armed Bandits With Linear Rewards and Individual Observations , 2010, IEEE/ACM Transactions on Networking.

[24]  Xin Wang,et al.  Channel sensing order in multi-user cognitive radio networks , 2012, 2012 IEEE International Symposium on Dynamic Spectrum Access Networks.

[25]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 2022 .