Stable multiuser channel allocations in opportunistic spectrum access

We consider the distributed channel allocation problem in an asymmetrical opportunistic spectrum access (OSA) system where each secondary user possibly has different channel reward even in the same channel due to geographic dispersion. We formulate this problem as a Gale-Shapley stable theorem using game theory to optimize the sum reward of all secondary users. It is challenging to achieve the stable matching of user-channel pairs without centralized control and prior knowledge of channel availability statistics. In this paper, we present a novel decentralized order-optimal learning Gale-Shapley scheme (OLGS) in which secondary users learn from their local history data and individually adjust their behaviors in a time-varying OSA system. The proposed scheme eliminates collisions among secondary users by a one-to-one user-channel matching policy. It also achieves stable spectral allocations using learning method without assuming known channel parameters and independent of information exchange among secondary users. Simulation results show that the system regret of the OLGS solution grows with time at the logarithmic order with low complexity.

[1]  L. Shapley,et al.  College Admissions and the Stability of Marriage , 1962 .

[2]  Ian F. Akyildiz,et al.  NeXt generation/dynamic spectrum access/cognitive radio wireless networks: A survey , 2006, Comput. Networks.

[3]  Amir Leshem,et al.  Multichannel Opportunistic Carrier Sensing for Stable Channel Access Control in Cognitive Radio Systems , 2012, IEEE Journal on Selected Areas in Communications.

[4]  Ananthram Swami,et al.  Joint Design and Separation Principle for Opportunistic Spectrum Access , 2006, ASILOMAR 2006.

[5]  Yi Gai,et al.  Learning Multiuser Channel Allocations in Cognitive Radio Networks: A Combinatorial Multi-Armed Bandit Formulation , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[6]  H. Vincent Poor,et al.  Cognitive Medium Access: Exploration, Exploitation, and Competition , 2007, IEEE Transactions on Mobile Computing.

[7]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[8]  Ananthram Swami,et al.  Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret , 2010, IEEE Journal on Selected Areas in Communications.

[9]  Andrea J. Goldsmith,et al.  Breaking Spectrum Gridlock With Cognitive Radios: An Information Theoretic Perspective , 2009, Proceedings of the IEEE.

[10]  Qing Zhao,et al.  Distributed Learning in Multi-Armed Bandit With Multiple Players , 2009, IEEE Transactions on Signal Processing.

[11]  Alagan Anpalagan,et al.  Opportunistic Spectrum Access in Unknown Dynamic Environment: A Game-Theoretic Stochastic Learning Solution , 2012, IEEE Transactions on Wireless Communications.