Game-based hierarchical multi-armed bandit learning algorithm for joint channel and power allocation in underwater acoustic communication networks

Abstract This study considers a joint channel and power allocation for multiple users in underwater acoustic communication networks as a formulated multiplayer MAB game. This study also proposes hierarchical learning algorithms, which do not need any prior environmental information and direct information exchange among users, to improve the learning ability. In upper sub-learning, each user generates a strategy through the traditional UCB1 strategy. In lower sub-learning, the concept of virtual learning information, which can be obtained as the reward of the last actual played strategy, is introduced to enrich the learning information. Users can enhance their learning ability by learning the outdated virtual learning information in lower sub-learning. As a result, the learning time it takes to achieve the NE is effectively decreased, and the cost of the algorithm is reduced. A distributed optimal NE selection mechanism is proposed to avoid falling into an inadequate local extreme value. Simulation results show high convergence speed and achieved utility of the proposed algorithm.

[1]  T.M. Duman,et al.  High-Rate Communication for Underwater Acoustic Channels Using Multiple Transmitters and Space–Time Coding: Receiver Structures and Experimental Results , 2007, IEEE Journal of Oceanic Engineering.

[2]  Seong-Lyun Kim,et al.  Joint subcarrier and power allocation in uplink OFDMA systems , 2005, IEEE Communications Letters.

[3]  Qing Zhao,et al.  Learning and sharing in a changing world: Non-Bayesian restless bandit with multiple players , 2011, 2011 Information Theory and Applications Workshop.

[4]  Mehul Motani,et al.  Price-Based Resource Allocation for Spectrum-Sharing Femtocell Networks: A Stackelberg Game Approach , 2012, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[5]  Michael Zuba,et al.  DA-Sync: A Doppler-Assisted Time-Synchronization Scheme for Mobile Underwater Sensor Networks , 2014, IEEE Transactions on Mobile Computing.

[6]  Wei Yu,et al.  Iterative water-filling for Gaussian vector multiple-access channels , 2001, IEEE Transactions on Information Theory.

[7]  Kenneth W. Shum,et al.  Convergence of Iterative Waterfilling Algorithm for Gaussian Interference Channels , 2007, IEEE Journal on Selected Areas in Communications.

[8]  Song Han,et al.  Hierarchical-game-based algorithm for downlink joint subchannel and power allocation in OFDMA femtocell networks , 2016, J. Netw. Comput. Appl..

[9]  Sudarshan Guruacharya,et al.  Hierarchical Competition for Downlink Power Allocation in OFDMA Femtocell Networks , 2013, IEEE Transactions on Wireless Communications.

[10]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[11]  Milica Stojanovic,et al.  On the Achievable Rate of a Class of Acoustic Channels and Practical Power Allocation Strategies for OFDM Systems , 2015, IEEE Journal of Oceanic Engineering.

[12]  Milica Stojanovic,et al.  On Joint Frequency and Power Allocation in a Cross-Layer Protocol for Underwater Acoustic Networks , 2010 .

[13]  M. Stojanovic,et al.  Statistical Characterization and Computationally Efficient Modeling of a Class of Underwater Acoustic Communication Channels , 2013, IEEE Journal of Oceanic Engineering.

[14]  Yi Gai,et al.  Online learning algorithms for stochastic water-filling , 2011, 2012 Information Theory and Applications Workshop.

[15]  Tho Le-Ngoc,et al.  Joint Subchannel Assignment and Power Allocation for OFDMA Femtocell Networks , 2014, IEEE Transactions on Wireless Communications.

[16]  Yi Gai,et al.  Distributed Stochastic Online Learning Policies for Opportunistic Spectrum Access , 2014, IEEE Transactions on Signal Processing.

[17]  Jun-Hong Cui,et al.  A joint power control and rate adaptation MAC protocol for underwater sensor networks , 2015, Ad Hoc Networks.

[18]  Dong In Kim,et al.  Tier-Aware Resource Allocation in OFDMA Macrocell-Small Cell Networks , 2014, IEEE Transactions on Communications.

[19]  Michele Zorzi,et al.  Protocol design issues in underwater acoustic networks , 2011, Comput. Commun..

[20]  J. Walrand,et al.  Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards , 1987 .

[21]  Lei Yan,et al.  Relay Selection in Underwater Acoustic Cooperative Networks: A Contextual Bandit Approach , 2017, IEEE Communications Letters.

[22]  Yi Gai,et al.  Learning Multiuser Channel Allocations in Cognitive Radio Networks: A Combinatorial Multi-Armed Bandit Formulation , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[23]  M. Stojanovic,et al.  Underwater acoustic networks , 2000, IEEE Journal of Oceanic Engineering.

[24]  Zhu Han,et al.  Learning for Robust Routing Based on Stochastic Game in Cognitive Radio Networks , 2016, IEEE Transactions on Communications.

[25]  Setareh Maghsudi,et al.  Joint Channel Selection and Power Control in Infrastructureless Wireless Networks: A Multiplayer Multiarmed Bandit Framework , 2014, IEEE Transactions on Vehicular Technology.

[26]  Milica Stojanovic,et al.  Underwater acoustic communication channels: Propagation models and statistical characterization , 2009, IEEE Communications Magazine.

[27]  Yuanqing Xia,et al.  Power Allocation Robust to Time-Varying Wireless Channels in Femtocell Networks , 2016, IEEE Transactions on Vehicular Technology.

[28]  Gábor Lugosi,et al.  Prediction, learning, and games , 2006 .

[29]  Yueming Cai,et al.  Optimal Power Allocation and User Scheduling in Multicell Networks: Base Station Cooperation Using a Game-Theoretic Approach , 2014, IEEE Transactions on Wireless Communications.

[30]  Kai Ma,et al.  Hierarchical-Game-Based Uplink Power Control in Femtocell Networks , 2014, IEEE Transactions on Vehicular Technology.

[31]  Yueming Cai,et al.  Optimal Power Control in Ultra-Dense Small Cell Networks: A Game-Theoretic Approach , 2017, IEEE Transactions on Wireless Communications.

[32]  Michael Zuba,et al.  Challenges and Opportunities of Underwater Cognitive Acoustic Networks , 2014, IEEE Transactions on Emerging Topics in Computing.