Multichannel non-persistent CSMA MAC schemes with reinforcement learning for cognitive radio networks

This paper presents two multichannel non-persistent CSMA (M-np-CSMA) MAC schemes using Simple Reinforcement Learning and State-Action-Reward-State-Action (SARSA) learning respectively for distributed cognitive radio networks. The two learning schemes both use reinforcement learning to help the users learn the environment and historical transmissions. Compared with M-np-CSMA MAC protocol with random channel choice, the learning schemes can help the cognitive users choose the best channels which offer more spectrum access opportunities to sense and access. The results show that both learning schemes can effectively improve the throughput and decrease the packet delay at heavy traffic loads and with a large number of cognitive users. The Simple Reinforcement Learning scheme and SARSA scheme can achieves a 15% and 25% improvement in the maximum throughput respectively, compared with the M-np-CSMA without learning.

[1]  Joseph Mitola,et al.  Cognitive radio: making software radios more personal , 1999, IEEE Wirel. Commun..

[2]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[3]  Zhu Han,et al.  Dynamics of Multiple-Seller and Multiple-Buyer Spectrum Trading in Cognitive Radio Networks: A Game-Theoretic Modeling Approach , 2009, IEEE Transactions on Mobile Computing.

[4]  Marco Ajmone Marsan,et al.  Multichannel Local Area Network Protocols , 1983, IEEE J. Sel. Areas Commun..

[5]  Simon Haykin,et al.  Cognitive radio: brain-empowered wireless communications , 2005, IEEE Journal on Selected Areas in Communications.

[6]  E. Protonotarios,et al.  Queueing analysis of CSMA protocols , 1986, 1986 25th IEEE Conference on Decision and Control.

[7]  V. Veeravalli,et al.  Dynamic spectrum access with learning for cognitive radio , 2008 .

[8]  F. L. Lo,et al.  Performance comparison of single and multi-channel CSMA-CD wireless networks using equilibrium point analysis , 1996, Proceedings of Vehicular Technology Conference - VTC.

[9]  Simon Haykin,et al.  A dynamic channel assignment policy through Q-learning , 1999, IEEE Trans. Neural Networks.

[10]  David Grace,et al.  Collision reduction in cognitive radio using multichannel 1-persistent CSMA combined with reinforcement learning , 2010, 2010 Proceedings of the Fifth International Conference on Cognitive Radio Oriented Wireless Networks and Communications.

[11]  L. Kleinrock,et al.  Packet Switching in Radio Channels: Part I - Carrier Sense Multiple-Access Modes and Their Throughput-Delay Characteristics , 1975, IEEE Transactions on Communications.

[12]  S.D. Jones,et al.  An experiment for sensing-based opportunistic spectrum access in CSMA/CA networks , 2005, First IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks, 2005. DySPAN 2005..

[13]  Mahesan Niranjan,et al.  On-line Q-learning using connectionist systems , 1994 .

[14]  D. Grace,et al.  Performance of cognitive radio reinforcement spectrum sharing using different weighting factors , 2008, 2008 Third International Conference on Communications and Networking in China.