Cooperative Q-learning for multiple secondary users in dynamic spectrum access

In this paper, we present and evaluate learning schemes that allow multiple secondary users to locate and use spectrum opportunities effectively, thus improving efficiency of dynamic spectrum access (DSA) systems. Using simulations, we show that the proposed schemes achieve good performances in terms of throughput and fairness, and does so by interacting with and learning from the environment only, without requiring prediction models of the environment's dynamics and behaviors.

[1]  Kang G. Shin,et al.  What and how much to gain by spectrum agility? , 2007, IEEE Journal on Selected Areas in Communications.

[2]  K. J. Ray Liu,et al.  Multi-Stage Pricing Game for Collusion-Resistant Dynamic Spectrum Allocation , 2008, IEEE Journal on Selected Areas in Communications.

[3]  Ryszard Kowalczyk,et al.  Dynamic analysis of multiagent Q-learning with ε-greedy exploration , 2009, ICML '09.

[4]  Hanif D. Sherali,et al.  Spectrum Sharing for Multi-Hop Networking with Cognitive Radios , 2008, IEEE Journal on Selected Areas in Communications.

[5]  Kang G. Shin,et al.  Efficient Discovery of Spectrum Opportunities with MAC-Layer Sensing in Cognitive Radio Networks , 2008, IEEE Transactions on Mobile Computing.

[6]  Richard S. Sutton,et al.  Dimensions of Reinforcement Learning , 1998 .

[7]  Hüseyin Arslan,et al.  Exploiting location awareness toward improved wireless system design in cognitive radio , 2008, IEEE Communications Magazine.

[8]  Hang Su,et al.  Cross-Layer Based Opportunistic MAC Protocols for QoS Provisionings Over Cognitive Radio Wireless Networks , 2008, IEEE Journal on Selected Areas in Communications.

[9]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[10]  P. Ballon,et al.  Flexible Spectrum Management and the Need for Controlling Entities for Reconfigurable Wireless Systems , 2007, 2007 2nd IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks.

[11]  Amir Ghasemi,et al.  Interference Aggregation in Spectrum-Sensing Cognitive Wireless Networks , 2008, IEEE Journal of Selected Topics in Signal Processing.

[12]  Syed Ali Jafar,et al.  Cognitive Radio Networks: How Much Spectrum Sharing is Optimal? , 2007, IEEE GLOBECOM 2007 - IEEE Global Telecommunications Conference.

[13]  Fangwen Fu,et al.  Detection of Spectral Resources in Cognitive Radios Using Reinforcement Learning , 2008, 2008 3rd IEEE Symposium on New Frontiers in Dynamic Spectrum Access Networks.

[14]  Mark McHenry Dupont Circle Spectrum Utilization During Peak Hours , 2003 .

[15]  Zhu Han,et al.  Distributive Opportunistic Spectrum Access for Cognitive Radio using Correlated Equilibrium and No-Regret Learning , 2007, 2007 IEEE Wireless Communications and Networking Conference.

[16]  Shuguang Cui,et al.  Optimal Linear Cooperation for Spectrum Sensing in Cognitive Radio Networks , 2008, IEEE Journal of Selected Topics in Signal Processing.