Learning Temporal–Spatial Spectrum Reuse

We formulate and study a multi-user multi-armed bandit problem that exploits the temporal-spatial opportunistic spectrum access (OSA) of primary user (PU) channels, so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized channel allocation policy that has logarithmic regret, but requires a central processor to solve an NP-complete optimization problem at exponentially increasing time intervals. To overcome the high computation complexity at the central processor, we also propose heuristic distributed policies that, however, have linear regrets. Our first distributed policy utilizes a distributed graph coloring and consensus algorithm to determine SUs' channel access ranks, while our second distributed policy incorporates channel access rank learning in a local procedure at each SU at the cost of a higher regret. We compare the performance of our proposed policies with other distributed policies recently proposed for temporal (but not spatial) OSA. We show that all these policies have linear regrets in our temporal-spatial OSA framework. Simulations suggest that our proposed policies have significantly smaller regrets than the other policies when spectrum temporal-spatial reuse is allowed.

[1]  Orhan Dagdeviren,et al.  PERFORMANCE EVALUATION OF DISTRIBUTED SYNCHRONOUS GREEDY GRAPH COLORING ALGORITHMS ON W IRELESS AD HOC AND SENSOR NETWORKS , 2013 .

[2]  Yi Gai,et al.  Distributed Stochastic Online Learning Policies for Opportunistic Spectrum Access , 2014, IEEE Transactions on Signal Processing.

[3]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[4]  Daniel Brélaz,et al.  New methods to color the vertices of a graph , 1979, CACM.

[5]  Mohamed-Slim Alouini,et al.  Generalized Mean Detector for Collaborative Spectrum Sensing , 2013, IEEE Transactions on Communications.

[6]  Brian L. Mark,et al.  Estimation of maximum interference-free power level for opportunistic spectrum access , 2009, IEEE Transactions on Wireless Communications.

[7]  Ananthram Swami,et al.  Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret , 2010, IEEE Journal on Selected Areas in Communications.

[8]  G. Nemhauser,et al.  Integer Programming , 2020 .

[9]  Richard M. Karp,et al.  Reducibility among combinatorial problems" in complexity of computer computations , 1972 .

[10]  Wee-Peng Tay,et al.  The Value of Feedback in Decentralized Detection , 2011, IEEE Transactions on Information Theory.

[11]  Venugopal V. Veeravalli,et al.  Cooperative Sensing for Primary Detection in Cognitive Radio , 2008, IEEE Journal of Selected Topics in Signal Processing.

[12]  Narsingh Deo,et al.  Graph Theory with Applications to Engineering and Computer Science , 1975, Networks.

[13]  Gerd Kortuem,et al.  Smart objects as building blocks for the Internet of things , 2010, IEEE Internet Computing.

[14]  Naumaan Nayyar,et al.  Decentralized Learning for Multiplayer Multiarmed Bandits , 2014, IEEE Transactions on Information Theory.

[15]  Kang G. Shin,et al.  Detection of Small-Scale Primary Users in Cognitive Radio Networks , 2011, IEEE Journal on Selected Areas in Communications.

[16]  Alagan Anpalagan,et al.  Opportunistic Spectrum Access with Spatial Reuse: Graphical Game and Uncoupled Learning Solutions , 2013, IEEE Transactions on Wireless Communications.

[17]  Zhijin Qin,et al.  Wideband Spectrum Sensing on Real-Time Signals at Sub-Nyquist Sampling Rates in Single and Cooperative Multiple Nodes , 2016, IEEE Transactions on Signal Processing.

[18]  Moe Z. Win,et al.  On the Impact of Node Failures and Unreliable Communications in Dense Sensor Networks , 2008, IEEE Transactions on Signal Processing.

[19]  Qing Zhao,et al.  Distributed Learning in Multi-Armed Bandit With Multiple Players , 2009, IEEE Transactions on Signal Processing.

[20]  Qihui Wu,et al.  Cellular-Base-Station-Assisted Device-to-Device Communications in TV White Space , 2015, IEEE Journal on Selected Areas in Communications.

[21]  Qian Zhang,et al.  Cooperative Boundary Detection for Spectrum Sensing Using Dedicated Wireless Sensor Networks , 2010, 2010 Proceedings IEEE INFOCOM.

[22]  Yi Zhang,et al.  Distributed opportunistic spectrum access with spatial reuse in cognitive radio networks , 2014, 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[23]  Adrian Kosowski,et al.  Classical Coloring of Graphs , 2008 .

[24]  Saswati Sarkar,et al.  Spectrum Pricing Games with Spatial Reuse in Cognitive Radio Networks , 2012, IEEE Journal on Selected Areas in Communications.

[25]  Qihui Wu,et al.  Spectrum Sensing in Opportunity-Heterogeneous Cognitive Sensor Networks: How to Cooperate? , 2013, IEEE Sensors Journal.

[26]  Shridhar Mubaraq Mishra Maximizing available spectrum for cognitive radios , 2009 .

[27]  Alex Olshevsky,et al.  Linear Time Average Consensus on Fixed Graphs and Implications for Decentralized Optimization and Multi-Agent Control , 2014, 1411.4186.

[28]  Yi Zhang,et al.  Distributed Boundary Estimation for Spectrum Sensing in Cognitive Radio Networks , 2014, IEEE J. Sel. Areas Commun..

[29]  Rajarathnam Chandramouli,et al.  Distributed Learning in Secondary Spectrum Sharing Graphical Game , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[30]  Yi Zhang,et al.  Distributed boundary estimation for spectrum sensing in cognitive radio networks , 2013, 2013 IEEE Wireless Communications and Networking Conference (WCNC).

[31]  Hyundong Shin,et al.  Distributed Local Linear Parameter Estimation Using Gaussian SPAWN , 2014, IEEE Transactions on Signal Processing.

[32]  Jean-Pierre Hubaux,et al.  Efficient MAC in cognitive radio systems: A game-theoretic approach , 2009, IEEE Transactions on Wireless Communications.

[33]  Xu Chen,et al.  Distributed Spectrum Access with Spatial Reuse , 2012, IEEE Journal on Selected Areas in Communications.

[34]  H. Poor,et al.  Fully Distributed State Estimation for Wide-Area Monitoring Systems , 2012, IEEE Transactions on Smart Grid.

[35]  Seong-Lyun Kim,et al.  Optimal Detection of Spatial Opportunity in Wireless Networks , 2011, IEEE Communications Letters.

[36]  K. J. Ray Liu,et al.  Multi-Channel Sensing and Access Game: Bayesian Social Learning with Negative Network Externality , 2014, IEEE Transactions on Wireless Communications.

[37]  Sinan Gezici,et al.  Error Rate Analysis of Cognitive Radio Transmissions with Imperfect Channel Sensing , 2013, 2013 IEEE 78th Vehicular Technology Conference (VTC Fall).

[38]  Yongsheng Ding,et al.  An Intelligent Self-Organization Scheme for the Internet of Things , 2013, IEEE Computational Intelligence Magazine.

[39]  Wee Peng Tay Whose Opinion to Follow in Multihypothesis Social Learning? A Large Deviations Perspective , 2014, IEEE Journal of Selected Topics in Signal Processing.

[40]  K. J. Ray Liu,et al.  Joint Spectrum Sensing and Access Evolutionary Game in Cognitive Radio Networks , 2013, IEEE Transactions on Wireless Communications.

[41]  Richard M. Karp,et al.  Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[42]  Qing Zhao,et al.  Decentralized dynamic spectrum access for cognitive radios: cooperative design of a non-cooperative game , 2009, IEEE Transactions on Communications.

[43]  H. Robbins,et al.  Asymptotically efficient adaptive allocation rules , 1985 .

[44]  Alagan Anpalagan,et al.  Opportunistic Spectrum Access in Unknown Dynamic Environment: A Game-Theoretic Stochastic Learning Solution , 2012, IEEE Transactions on Wireless Communications.

[45]  Lazaros F. Merakos,et al.  Spatial Spectrum Reuse for Opportunistic Spectrum Access in Infrastructure-Based Systems , 2013, Wirel. Pers. Commun..

[46]  Qihui Wu,et al.  Spatial-Temporal Opportunity Detection for Spectrum-Heterogeneous Cognitive Radio Networks: Two-Dimensional Sensing , 2013, IEEE Transactions on Wireless Communications.

[47]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[48]  Haitao Zheng,et al.  Distributed spectrum allocation via local bargaining , 2005, 2005 Second Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks, 2005. IEEE SECON 2005..

[49]  Liuqing Yang,et al.  Cooperative Diversity of Spectrum Sensing for Cognitive Radio Systems , 2010, IEEE Transactions on Signal Processing.

[50]  Yi Zhang,et al.  Opportunistic spectrum access with temporal-spatial reuse in cognitive radio networks , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[51]  K. J. Ray Liu,et al.  Renewal-theoretical dynamic spectrum access in cognitive radio network with unknown primary behavior , 2011, IEEE Journal on Selected Areas in Communications.

[52]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[53]  O. Dagdeviren,et al.  PERFORMANCE EVALUATION OF DISTRIBUTED SYNCHRONOUS GREEDY GRAPH COLORING ALGORITHMS ON WIRELESS AD HOC AND SENSOR NETWORKS , 2013 .