Coordinating Secondary-User Behaviors for Inelastic Traffic Reward Maximization in Large-Scale \osa Networks

We develop efficient coordination techniques that support inelastic traffic in large-scale distributed dynamic spectrum access (DSA) networks. By means of any learning algorithm, the proposed techniques enable DSA users to locate and exploit spectrum opportunities effectively, thereby increasing their achieved throughput (or “rewards” to be more general). Basically, learning algorithms allow DSA users to learn by interacting with the environment, and use their acquired knowledge to select the proper actions that maximize their own objectives, thereby “hopefully” maximizing their long-term cumulative received reward. However, when DSA users' objectives are not carefully coordinated, learning algorithms can lead to poor overall system performance, resulting in lesser per-user average achieved rewards. In this paper, we derive efficient objective functions that DSA users can aim to maximize, and that by doing so, users' collective behavior also leads to good overall system performance, thus maximizing each user's long-term cumulative received rewards. We show that the proposed techniques are: (i) efficient by enabling users to achieve high rewards, (ii) scalable by performing well in systems with a small as well as a large number of users, (iii) learnable by allowing users to reach up high rewards very quickly, and (iv) distributive by being implementable in a decentralized manner.

[1]  Baochun Li,et al.  A Secondary Market for Spectrum , 2010, 2010 Proceedings IEEE INFOCOM.

[2]  M. Chatterjee,et al.  An Economic Framework for Dynamic Spectrum Access and Service Pricing , 2009, IEEE/ACM Transactions on Networking.

[3]  G. Tesauro Practical Issues in Temporal Difference Learning , 1992 .

[4]  Ananthram Swami,et al.  Joint Design and Separation Principle for Opportunistic Spectrum Access in the Presence of Sensing Errors , 2007, IEEE Transactions on Information Theory.

[5]  Mingyan Liu,et al.  Revenue generation for truthful spectrum auction in dynamic spectrum access , 2009, MobiHoc '09.

[6]  Kagan Tumer,et al.  Distributed agent-based air traffic flow management , 2007, AAMAS '07.

[7]  Bechir Hamdaoui,et al.  Aligning Spectrum-User Objectives for Maximum Inelastic-Traffic Reward , 2011, 2011 Proceedings of 20th International Conference on Computer Communications and Networks (ICCCN).

[8]  Shuguang Cui,et al.  Optimal Linear Cooperation for Spectrum Sensing in Cognitive Radio Networks , 2008, IEEE Journal of Selected Topics in Signal Processing.

[9]  Jeffrey H. Reed,et al.  Designing and deploying a building-wide cognitive radio network testbed , 2010, IEEE Communications Magazine.

[10]  Kaushik R. Chowdhury,et al.  A survey on MAC protocols for cognitive radio networks , 2009, Ad Hoc Networks.

[11]  Bhaskar Krishnamachari,et al.  Dynamic Multichannel Access With Imperfect Channel State Detection , 2010, IEEE Transactions on Signal Processing.

[12]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[13]  Bechir Hamdaoui,et al.  iMAC: improved Medium Access Control for multi-channel multi-hop wireless networks , 2013, Wirel. Commun. Mob. Comput..

[14]  Marceau Coupechoux,et al.  An Auction Framework for Spectrum Allocation with Interference Constraint in Cognitive Radio Networks , 2010, 2010 Proceedings IEEE INFOCOM.

[15]  Kagan Tumer,et al.  Efficient Evaluation Functions for Evolving Coordination , 2008, Evolutionary Computation.

[16]  Husheng Li Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems , 2010, EURASIP J. Wirel. Commun. Netw..

[17]  Maria-Gabriella Di Benedetto,et al.  A Survey on MAC Strategies for Cognitive Radio Networks , 2012, IEEE Communications Surveys & Tutorials.

[18]  Kang G. Shin,et al.  OS-MAC: An Efficient MAC Protocol for Spectrum-Agile Wireless Networks , 2008, IEEE Transactions on Mobile Computing.

[19]  Sachin Shetty,et al.  A Learning-based Multiuser Opportunistic Spectrum Access Approach in Unslotted Primary Networks , 2009, IEEE INFOCOM 2009.

[20]  Hua Liu,et al.  Channel Selection in Multi-Channel Opportunistic Spectrum Access Networks with Perfect Sensing , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[21]  W. Marsden I and J , 2012 .

[22]  Kang G. Shin,et al.  Efficient Discovery of Spectrum Opportunities with MAC-Layer Sensing in Cognitive Radio Networks , 2008, IEEE Transactions on Mobile Computing.

[23]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[24]  Liesbet Van der Perre,et al.  A Distributed Multichannel MAC Protocol for Multihop Cognitive Radio Networks , 2010, IEEE Transactions on Vehicular Technology.

[25]  Venugopal V. Veeravalli,et al.  Algorithms for Dynamic Spectrum Access With Learning for Cognitive Radio , 2008, IEEE Transactions on Signal Processing.

[26]  Zhu Han,et al.  Distributive Opportunistic Spectrum Access for Cognitive Radio using Correlated Equilibrium and No-Regret Learning , 2007, 2007 IEEE Wireless Communications and Networking Conference.

[27]  Lang Tong,et al.  Optimal Cognitive Access of Markovian Channels under Tight Collision Constraints , 2011, IEEE J. Sel. Areas Commun..

[28]  TesauroGerald Practical Issues in Temporal Difference Learning , 1992 .

[29]  Fangwen Fu,et al.  Detection of Spectral Resources in Cognitive Radios Using Reinforcement Learning , 2008, 2008 3rd IEEE Symposium on New Frontiers in Dynamic Spectrum Access Networks.

[30]  Venugopal V. Veeravalli,et al.  Cooperative Sensing for Primary Detection in Cognitive Radio , 2008, IEEE Journal of Selected Topics in Signal Processing.

[31]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[32]  V. Veeravalli,et al.  Dynamic spectrum access with learning for cognitive radio , 2008 .

[33]  Jeffrey H. Reed,et al.  Cyclostationary Approaches to Signal Detection and Classification in Cognitive Radio , 2007, 2007 2nd IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks.

[34]  Kagan Tumer,et al.  Unifying temporal and structural credit assignment problems , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[35]  Chao Zou,et al.  On game theoretic DSA-driven MAC for cognitive radio networks , 2009, Comput. Commun..

[36]  Saswati Sarkar,et al.  Spectrum Auction Framework for Access Allocation in Cognitive Radio Networks , 2010, IEEE/ACM Transactions on Networking.

[37]  Shamik Sengupta,et al.  An economic framework for dynamic spectrum access and service pricing , 2009, IEEE/ACM Trans. Netw..

[38]  Jianwei Huang,et al.  Competition with Dynamic Spectrum Leasing , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[39]  Mingyan Liu,et al.  Optimality of Myopic Sensing in Multi-Channel Opportunistic Access , 2008, 2008 IEEE International Conference on Communications.

[40]  Hüseyin Arslan,et al.  A survey of spectrum sensing algorithms for cognitive radio applications , 2009, IEEE Communications Surveys & Tutorials.

[41]  Kagan Tumer,et al.  Analyzing and visualizing multiagent rewards in dynamic and stochastic domains , 2008, Autonomous Agents and Multi-Agent Systems.

[42]  Danny H. K. Tsang,et al.  Joint design of spectrum sharing and routing with channel heterogeneity in cognitive radio networks , 2009, Phys. Commun..

[43]  Neil Genzlinger A. and Q , 2006 .

[44]  Hua Liu,et al.  Cooperation and Learning in Multiuser Opportunistic Spectrum Access , 2008, ICC Workshops - 2008 IEEE International Conference on Communications Workshops.

[45]  Xin Liu,et al.  Optimal Bandwidth Selection in Multi-Channel Cognitive Radio Networks: How Much is Too Much? , 2008, 2008 3rd IEEE Symposium on New Frontiers in Dynamic Spectrum Access Networks.

[46]  Qing Zhao,et al.  Distributed Learning in Multi-Armed Bandit With Multiple Players , 2009, IEEE Transactions on Signal Processing.