Relay Selection for Underwater Acoustic Sensor Networks: A Multi-User Multi-Armed Bandit Formulation

Multi-user cooperative transmission is an attractive architecture for underwater acoustic sensor networks (UASNs). Cooperative transmission depends on careful allocations of resources such as relay selection, but traditional relay selection requires precise measurements of channel state information, which is infeasible for multi-user cooperative transmission due to the unique features and hardware restrictions of UASNs. In this paper, we model multi-user relay selection under a multiuser multi-armed bandit (MU-MAB) framework, whereby users are not provided any prior knowledge about underwater acoustic channel conditions. We first exploit a novel MU-MAB algorithm, DSMU-MAB, for relay selection, assuming that the reward distributions are initially unknown but remain constant. Second, we consider an evolving environment in which the reward distributions undergo changes in time, and DSMU-rMAB, a derivative of DSMU-MAB, is proposed, which can be robust to abrupt changes in underwater communication environments. The proposed algorithms not only help sources find the suitable relays to achieve a high quality transmission and avoid collisions among users but also reduce the mass of information exchanged among users. We established the effectiveness of our proposed algorithms using theoretical and numerical analyses.

[1]  Ning Sun,et al.  Secure communication for underwater acoustic sensor networks , 2015, IEEE Communications Magazine.

[2]  Setareh Maghsudi,et al.  Multi-armed bandits with application to 5G small cells , 2015, IEEE Wireless Communications.

[3]  Mandar Chitre,et al.  Tuning an underwater communication link , 2013, 2013 MTS/IEEE OCEANS - Bergen.

[4]  A. J. Han Vinck,et al.  Relay selection in cooperative power line communication: A multi-armed bandit approach , 2017, Journal of Communications and Networks.

[5]  Rica Gonen,et al.  An incentive-compatible multi-armed bandit mechanism , 2007, PODC '07.

[6]  Yunsi Fei,et al.  QELAR: A Machine-Learning-Based Adaptive Routing Protocol for Energy-Efficient and Lifetime-Extended Underwater Sensor Networks , 2010, IEEE Transactions on Mobile Computing.

[7]  Lei Yan,et al.  Relay Selection in Underwater Acoustic Cooperative Networks: A Contextual Bandit Approach , 2017, IEEE Communications Letters.

[8]  Tolga M. Duman,et al.  Cooperative underwater acoustic communications [Accepted From Open Call] , 2013, IEEE Communications Magazine.

[9]  Ataollah Ebrahimzadeh,et al.  Adaptive Relay Selection and Power Allocation for OFDM Cooperative Underwater Acoustic Systems , 2018, IEEE Transactions on Mobile Computing.

[10]  Jun-Hong Cui,et al.  A joint power control and rate adaptation MAC protocol for underwater sensor networks , 2015, Ad Hoc Networks.

[11]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[12]  Setareh Maghsudi,et al.  Joint channel allocation and power control for underlay D2D transmission , 2015, 2015 IEEE International Conference on Communications (ICC).

[13]  Amir Leshem,et al.  Multichannel Opportunistic Carrier Sensing for Stable Channel Access Control in Cognitive Radio Systems , 2012, IEEE Journal on Selected Areas in Communications.

[14]  Alagan Anpalagan,et al.  A Survey of Distributed Relay Selection Schemes in Cooperative Wireless Ad hoc Networks , 2012, Wirel. Pers. Commun..

[15]  Yang Wei,et al.  Exploiting Cooperative Relay for Reliable Communications in Underwater Acoustic Sensor Networks , 2014, 2014 IEEE Military Communications Conference.

[16]  Bhaskar Krishnamachari,et al.  On myopic sensing for multi-channel opportunistic access: structure, optimality, and performance , 2007, IEEE Transactions on Wireless Communications.

[17]  Khaled Ben Letaief,et al.  Multiuser OFDM with adaptive subcarrier, bit, and power allocation , 1999, IEEE J. Sel. Areas Commun..

[18]  Yi Gai,et al.  Learning Multiuser Channel Allocations in Cognitive Radio Networks: A Combinatorial Multi-Armed Bandit Formulation , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[19]  T. C. Yang Distributed Underwater Sensing: A Paradigm Change for the Future , 2014 .

[20]  Xiaojiang Du,et al.  Stable multiuser channel allocations in opportunistic spectrum access , 2013, 2013 IEEE Wireless Communications and Networking Conference (WCNC).

[21]  Walid Saad,et al.  Matching theory for future wireless networks: fundamentals and applications , 2014, IEEE Communications Magazine.

[22]  Gábor Lugosi,et al.  Minimax Policies for Combinatorial Prediction Games , 2011, COLT.

[23]  L. S. Shapley,et al.  College Admissions and the Stability of Marriage , 2013, Am. Math. Mon..

[24]  Mingyan Liu,et al.  On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[25]  Eric Moulines,et al.  On Upper-Confidence Bound Policies for Switching Bandit Problems , 2011, ALT.

[26]  Kapil R. Dandekar,et al.  Learning State Selection for Reconfigurable Antennas: A Multi-Armed Bandit Approach , 2014, IEEE Transactions on Antennas and Propagation.

[27]  Tao Jiang,et al.  Toward Optimal Adaptive Wireless Communications in Unknown Environments , 2015, IEEE Transactions on Wireless Communications.

[28]  Martial Hebert,et al.  Multi-armed recommendation bandits for selecting state machine policies for robotic systems , 2013, 2013 IEEE International Conference on Robotics and Automation.

[29]  Zhong Zhou,et al.  Effective Relay Selection for Underwater Cooperative Acoustic Networks , 2013, 2013 IEEE 10th International Conference on Mobile Ad-Hoc and Sensor Systems.

[30]  Tao Qin,et al.  Multi-Armed Bandit with Budget Constraint and Variable Costs , 2013, AAAI.

[31]  Mauro Birattari,et al.  Multi-armed Bandit Formulation of the Task Partitioning Problem in Swarm Robotics , 2012, ANTS.

[32]  Yi Gai,et al.  Distributed Stochastic Online Learning Policies for Opportunistic Spectrum Access , 2014, IEEE Transactions on Signal Processing.

[33]  Henk Wymeersch,et al.  An overview of project COMPOUND: Cooperative communications and positioning in mobile underwater networks , 2012, 2012 Future Network & Mobile Summit (FutureNetw).

[34]  Chris Murphy,et al.  CAPTURE: A Communications Architecture for Progressive Transmission via Underwater Relays With Eavesdropping , 2014, IEEE Journal of Oceanic Engineering.

[35]  Qing Zhao,et al.  A Restless Bandit Formulation of Opportunistic Access: Indexablity and Index Policy , 2008, 2008 5th IEEE Annual Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks Workshops.

[36]  Di He,et al.  Online learning for auction mechanism in bandit setting , 2013, Decis. Support Syst..

[37]  Setareh Maghsudi,et al.  Joint Channel Selection and Power Control in Infrastructureless Wireless Networks: A Multiplayer Multiarmed Bandit Framework , 2014, IEEE Transactions on Vehicular Technology.