Joint resource allocation in underwater acoustic communication networks: A game-based hierarchical adversarial multiplayer multiarmed bandit algorithm

Abstract In this study, an adversarial multiplayer multiarmed bandit (MAB) game is employed to model the problem of joint channel and power allocation in multiuser underwater acoustic communication networks (UACNs). Moreover, this study presents a distributed hierarchical learning algorithm that does not require any prior environmental information and direct information exchange among users. This algorithm has a two-tier learning approach that effectively improves user learning ability and decreases learning time. In upper learning, each user formulates a strategy by learning the actual played reward. Outdated virtual information, which can be obtained as the reward of a past-played strategy, is learned in lower learning. The dynamic lower learning mechanism is proposed to prevent falling into an inadequate local extreme value. The algorithm has high tolerance for delay and noncomplete information because of its unique learning behavior. Simulation results showed that the proposed algorithm achieves a high level of performance.

[1]  Yang Liu,et al.  A new Q-learning algorithm based on the metropolis criterion , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  Hao Zhou,et al.  OFDM-Modulated Dynamic Coded Cooperation in Underwater Acoustic Channels , 2015, IEEE Journal of Oceanic Engineering.

[3]  Mari Carmen Domingo,et al.  Marine communities based congestion control in underwater wireless sensor networks , 2013, Inf. Sci..

[4]  Mehul Motani,et al.  Price-Based Resource Allocation for Spectrum-Sharing Femtocell Networks: A Stackelberg Game Approach , 2012, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[5]  Michael Zuba,et al.  Challenges and Opportunities of Underwater Cognitive Acoustic Networks , 2014, IEEE Transactions on Emerging Topics in Computing.

[6]  Yi Gai,et al.  Distributed Stochastic Online Learning Policies for Opportunistic Spectrum Access , 2014, IEEE Transactions on Signal Processing.

[7]  Johanne Cohen,et al.  Multi-Armed Bandit for distributed Inter-Cell Interference Coordination , 2015, 2015 IEEE International Conference on Communications (ICC).

[8]  Song Han,et al.  Hierarchical-game-based algorithm for downlink joint subchannel and power allocation in OFDMA femtocell networks , 2016, J. Netw. Comput. Appl..

[9]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[10]  Yueting Zhang,et al.  Synthetic Aperture Radar Image Synthesis by Using Generative Adversarial Nets , 2017, IEEE Geoscience and Remote Sensing Letters.

[11]  Milica Stojanovic,et al.  On the Achievable Rate of a Class of Acoustic Channels and Practical Power Allocation Strategies for OFDM Systems , 2015, IEEE Journal of Oceanic Engineering.

[12]  Lei Yan,et al.  Relay Selection in Underwater Acoustic Cooperative Networks: A Contextual Bandit Approach , 2017, IEEE Communications Letters.

[13]  Zhu Han,et al.  Learning for Robust Routing Based on Stochastic Game in Cognitive Radio Networks , 2016, IEEE Transactions on Communications.

[14]  Kenneth W. Shum,et al.  Convergence of Iterative Waterfilling Algorithm for Gaussian Interference Channels , 2007, IEEE Journal on Selected Areas in Communications.

[15]  Kin-Man Lam,et al.  The OUC-vision large-scale underwater image database , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[16]  Xiang Cheng,et al.  Index modulated OFDM for underwater acoustic communications , 2016, IEEE Communications Magazine.

[17]  Nicolò Cesa-Bianchi,et al.  Gambling in a rigged casino: The adversarial multi-armed bandit problem , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[18]  Tho Le-Ngoc,et al.  Power allocation in wireless multi-user relay networks , 2009, IEEE Transactions on Wireless Communications.

[19]  Milica Stojanovic,et al.  Underwater acoustic communication channels: Propagation models and statistical characterization , 2009, IEEE Communications Magazine.

[20]  Yu Zheng,et al.  Urban Water Quality Prediction Based on Multi-Task Multi-View Learning , 2016, IJCAI.

[21]  Yi Gai,et al.  Online learning algorithms for stochastic water-filling , 2011, 2012 Information Theory and Applications Workshop.

[22]  Jun-Hong Cui,et al.  A joint power control and rate adaptation MAC protocol for underwater sensor networks , 2015, Ad Hoc Networks.

[23]  Raphaël Féraud,et al.  The non-stationary stochastic multi-armed bandit problem , 2017, International Journal of Data Science and Analytics.

[24]  Kai Ma,et al.  Hierarchical-Game-Based Uplink Power Control in Femtocell Networks , 2014, IEEE Transactions on Vehicular Technology.

[25]  Yi Gai,et al.  Learning Multiuser Channel Allocations in Cognitive Radio Networks: A Combinatorial Multi-Armed Bandit Formulation , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[26]  LinLin Shen,et al.  Visual-Patch-Attention-Aware Saliency Detection , 2015, IEEE Transactions on Cybernetics.

[27]  Dong In Kim,et al.  Tier-Aware Resource Allocation in OFDMA Macrocell-Small Cell Networks , 2014, IEEE Transactions on Communications.

[28]  M. Stojanovic,et al.  Underwater acoustic networks , 2000, IEEE Journal of Oceanic Engineering.

[29]  Tho Le-Ngoc,et al.  Joint Subchannel Assignment and Power Allocation for OFDMA Femtocell Networks , 2014, IEEE Transactions on Wireless Communications.

[30]  Luming Zhang,et al.  Action2Activity: Recognizing Complex Activities from Sensor Data , 2015, IJCAI.

[31]  Wei Yu,et al.  Iterative water-filling for Gaussian vector multiple-access channels , 2001, IEEE Transactions on Information Theory.

[32]  Richard Lippmann,et al.  Machine learning in adversarial environments , 2010, Machine Learning.

[33]  Peter Auer,et al.  The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..

[34]  P. Willett,et al.  MIMO-OFDM for High-Rate Underwater Acoustic Communications , 2009, IEEE Journal of Oceanic Engineering.

[35]  Michele Zorzi,et al.  Protocol design issues in underwater acoustic networks , 2011, Comput. Commun..

[36]  Kin-Man Lam,et al.  Saliency detection using quaternionic distance based weber local descriptor and level priors , 2017, Multimedia Tools and Applications.

[37]  Setareh Maghsudi,et al.  Joint Channel Selection and Power Control in Infrastructureless Wireless Networks: A Multiplayer Multiarmed Bandit Framework , 2014, IEEE Transactions on Vehicular Technology.

[38]  Michael Zuba,et al.  DA-Sync: A Doppler-Assisted Time-Synchronization Scheme for Mobile Underwater Sensor Networks , 2014, IEEE Transactions on Mobile Computing.

[39]  Yilong Yin,et al.  Integrating QDWD with pattern distinctness and local contrast for underwater saliency detection , 2018, J. Vis. Commun. Image Represent..

[40]  M. Stojanovic,et al.  Low Complexity OFDM Detector for Underwater Acoustic Channels , 2006, OCEANS 2006.

[41]  Sudarshan Guruacharya,et al.  Hierarchical Competition for Downlink Power Allocation in OFDMA Femtocell Networks , 2013, IEEE Transactions on Wireless Communications.

[42]  Milica Stojanovic,et al.  On Joint Frequency and Power Allocation in a Cross-Layer Protocol for Underwater Acoustic Networks , 2010 .

[43]  Daniel W. C. Ho,et al.  Consensus control for multiple AUVs under imperfect information caused by communication faults , 2016, Inf. Sci..

[44]  Yuanqing Xia,et al.  Power Allocation Robust to Time-Varying Wireless Channels in Femtocell Networks , 2016, IEEE Transactions on Vehicular Technology.

[45]  Gábor Lugosi,et al.  Prediction, learning, and games , 2006 .

[46]  Petar Popovski,et al.  The METIS 5G System Concept: Meeting the 5G Requirements , 2016, IEEE Communications Magazine.